.mm Ran'rt PfTT/Pm n 7 FF p o 9 n 2 



FORM PTO-1390 
(REV. 12-2001) 



U S DEPARTMENT OF COMMERCE PATENT At.'D TRADEMARK OFFICE 



TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEY 'S DOCKET NUMBER 

7024-509 



U S APPLICATION NO (Ifknown, see 37 CFR 1 5 

70/049137 



INTERNATIONAL APPLICATION NO. 
PCT/US00/22725 



INTERNATIONAL FILING DATE 
18 August 2000 (18.08.00) 



PRIORITY DATE CLAIMED 
20 August 1999 (20.08.99) 



TITLE OF INVENTION 

METHODS AND COMPOSITIONS FOR REGULATING DEVELOPMENTAL IDENTITY 



APPLICANT(S) FOR DO/EOAJS 
Joseph P. OGAS; Chris R. SOMERVILLE 



Applicant herewith submits to the United States Designated/Elected Office (DO/EOAJS) the following items and other information: 

1 . [X] This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

2. 1 | This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371. 

3. L~] This is an express request to begin national examination procedures (35 U.S.C. 371(f)). The submission must include 

items (5), (6), (9) and (21) indicated below. 

4. O The US has been elected by the expiration of 19 months from the priority date (Article 31). 

5. (X] A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. Q is attached hereto (required only if not communicated by the International Bureau). 

b. rj] has been communicated by the International Bureau. 

c. Jx] is not required, as the application was filed in the United States Receiving Office (ROAJS). 

6. | 1 An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

a. rj is attached hereto. 

b. rj has been previously submitted under 35 U.S.C. 154(d)(4). 

7. [X] Amendments to the claims of the International Aplication under PCT Article 19 (35 U.S.C. 371(c)(3)) 

a. FJ are attached hereto (required only if not communicated by the International Bureau). 

b. [X] have been communicated by the International Bureau. 

c. FJ have not been made; however, the time limit for making such amendments has NOT expired. 

d. FT] have not been made and will not be made. 

8. QD An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371 (c)(3)). 

9. fX] An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). (unsigned) 

10. Q An English lanugage translation of the annexes of the International Preliminary Examination Report under PCT 
Article 36 (35 U.S.C. 371(c)(5)). 

Items 11 to 20 below concern document(s) or information included: 

1 | | An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

2. [H An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3 28 and 3.31 is included. 
3.1 | A FIRST preliminary amendment. 

4.0 A SECOND or SUBSEQUENT preliminary amendment. 

5.1 | A substitute specification. 

6. | j A change of power of attorney and/or address letter. 

7. fX] A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U S.C. 1 821 - 1.825 

8. [ | A second copy of the published international application under 35 U.S.C. 154(d)(4). 

9. CD A second copy of the English language translation of the international application under 35 U S.C 154(d)(4). 
_ Express Mail Label No.: EL551 8020441 US 

20. [X] Other items or information: Date of Deposit: FrSii 7j ■<? &£ > 2 

a. PCT Request, b. International Publication I hereby certify that this correspondence is being deposited with the United 

c. International Publication, with search report States Postal Service "Express Mail Post Office to Addressee" service under 

d. Notification of Receipt of Demand 37 CFR §1.10 on the date indicated above and is addressed to the Assistant 

e. Notification of Transmittal of the Search Report Commissioner forPatents- Was/iipgton, p.C. 20231 . 

f. Response to the International Search Report '^yx^tCt&S f7 . 



y pjN i Uj'^ i i cu i i nun, n. nubuxi i istj iu wntiun au i mu ii biynmufa a t person maiimy 



1 or fee 



^orm PCT/IB/304, 308, 332 
j. International Preliminary Examination Report 
k. Return Postcard 



U.S. APPLICATION NO*, (if known, sec 37 CFR I 5) 



37 CFR I 5) . _-5 



—j — y INTERNATIONAL APPLICATION NO 



2>BCT/US/00/22725 



ATTORNEY'S DOCKET NUMBER 

7024-509 



2 1 The following fees are submitted. 

BASIC NATIONAL FEE (37 CFR 1.492 (a) (1) - (5)): 

Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1 .445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO. . 



International preliminary examination fee (37 CFR 1.482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO 



$1040.00 



. S890.00 



International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee (37 CFR 1 .445(a)(2)) paid to USPTO $740.00 



International preliminary examination fee (37 CFR 1.482) paid to USPTO 
but all claims did not satisfy provisions of PCT Article 33(1 )-(4) 



$710.00 



International preliminary examination fee (37 CFR 1.482) paid to USPTO 

and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



100-- 



Surcharge of $130.00 for furnishing the oath or declaration later than [*~] 20 [x] 30 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



130- 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total claims 



83 



20 



63 



x $18.00 



$ 1134-- 



Independent claims 



14 



3 = 



11 



x $84.00 



$ 924-- 



MULTIPLE DEPENDENT CLAIM(S) (if applicable) 



+ $280.00 



$ - 



TOTAL OF ABOVE CALCULATIONS = 



$ 2288 



fat Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above 
are reduced by 1/2. . 



$ 1144- 



SUBTOTAL 



$ 1144- 



Processing fee of $130.00 for furnishing the English translation later than Q 20 Q 30 
months from the earliest claimed priority date (37 CFR 1 .492(f)). 



$ 



TOTAL NATIONAL FEE = 



$ 1 1 44- 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property + 



$ 



TOTAL FEES ENCLOSED 



$ 1 1 44- 



Amount to be 
refunded: 



charged: 



a. A check in the amount of $ 



1144.00 



to cover the above fees is enclosed. 



b. rj Please charge my Deposit Account No 



23-3030 



in the amount of $ . 



to cover the above fees. 



A duplicate copy of this sheet is enclosed. 



c. [T] The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No 23-3030 . A duplicate copy of this sheet is enclosed 

d. Q Fees are to be charged to a credit card WARNING: Information on this form may become public. Credit card 

information should not be included on this form. Provide credit card information and authorization on PTO-2038. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137 (a) or (b)) must be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO 



Gregory B. COY SI 
WOODARD, EMHARDT, NAUGHTON, MORIARTY & MCNETT Gregory B. COY 

Bank One Center/Tower, Suite 3700 



1 1 1 Monument Circle NAME 
Indianapolis, Indiana 46204 US #40,967 



REGISTRATION NUMBER 



FORM PTO-I 390 (REV 12-2001) pa E c2o("2 



, . n 7 FEB 7002 * '* 1 8 f Mf 1 3 ? 



WO 01/14519 iTBvMB ■ ww PCT/USOO/22725 



METHODS A imJCXWlgQSIIlQNS-H^ 
DEVELOPMENTAL IDENTITY 

5 This invention was made with government support under grant 

number DE-FG02-97ER20133 awarded by the U.S. Department of Energy, 
Office of Basic Energy Sciences. The Government has certain rights in the 
invention. 

10 CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims the benefit of U.S. Provisional 
Application Serial Number 60/149,975, filed on August 20, 1999, which is 
hereby incorporated by reference in its entirety. 

I 

15 BACKGROUND OF THE INVENTION 

The present invention relates generally to methods of transforming 
host cells with nucleic acid encoding proteins involved in regulating 
developmental identity. For example, methods are also provided that 
include regulating embryonic identity, as well as other steps in the 

20 developmental process, especially in plants. The invention further relates 
to recombinant nucleic acid molecules, plant cells and transgenic plants 
that may be advantageously used in the methods of the present invention. 

During the final stages of embryo development in angiosperms, the 
embryo accumulates massive amounts of nutrient storage reserves and 

25 then undergoes programmed desiccation and transition to dormancy [West, 
M.A. and Harada, J. J. (1993) Plant Cell 5: 1361-1369; Goldberg, R.B. et 
al. (1994) Science 266: 605-614; Kigel, J. and Galili, G. (Eds.) (1995) Seed 
development and germination, New York, M. Dekker ; McCarty, D. R. 
(1995) Annu. Rev. Plant Phys. 46: 71-93]. The embryo may remain 

30 dormant for extended periods of time. The quiescent embryo emerges from 
dormancy and undergoes post-embryonic vegetative development in 
response to one or more endogenous and exogenous cues that may vary 
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from one species to another. The regulatory processes that control the 
transition from the late stages of embryo development to vegetative growth 
and development are poorly characterized. 

LEC1 appears to play a key role in regulating embryo development 
5 in Arabidopsis [Meinke, D. W. (1992) Science 258: 1647-1650; Meinke, D. 
W. (1994) Plant Cell 6: 1049-1064; West, M.A.L. et al. (1994) Plant Cell 6: 
1731-1745; Parcy, F. et al. (1997) Plant Cell 9:1 731 -1745; Lotan, T. (1998) 
Cell 93(7): 1 195-205]. Seeds of led mutants exhibit numerous phenotypes, 
including defects in expression of maturation-specific genes, desiccation 

10 intolerance, premature germination, and abnormal expression of 

post-embryonic characteristics in cotyledons. LEC1 encodes a transcription 
factor, the HAP3 subunit of a CCAAT box-binding factor [Lotan, T. (1 998) 
Cell 93(7): 1 195-205]. The LEC1 transcript is expressed only in seeds, and 
can be detected in the embryo as early as the two-cell stage [Lotan, T. 

15 (1 998) Cell 93(7):1 1 95-205]. Expression of the LEC1 gene in 

non-embryonic tissues is sufficient to cause expression of embryonic 
differentiation characteristics [Lotan, T. (1998) Cell 93(7): 1 195-205]. 

The ability of the growth regulator gibberellin (GA) to promote 
germination of seeds of numerous plant species has been demonstrated 

20 through the use of chemical inhibitors of GA biosynthesis and the 

characterization of mutants defective in gibberellin biosynthesis [Ritchie, S. 
and Gilroy, S. (1998) New Phytol 140:363-383]. Very little is known about 
the mechanism by which GA promotes germination. Genes that exhibit 
GA-dependent transcription are known, and the ability of GA to regulate 

25 transcription of genes in the aleurone layer of germinating cereal grains has 
been extensively characterized [Huttly, A. K. and Phillips, A. L. (1995) 
Physiol Plant 95:310-317; Jacobsen, J.V. et al. (1995) The Netherlands, 
Kluwer Academic Publishers 246-271; Ritchie, S. and Gilroy, S. (1998) 
New Phytol 140: 363-383]. However, a receptor for GA has not been 

30 identified. GA plays other well-characterized roles in plant growth and 
development in addition to its role in germination, including promotion of 
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elongation and regulation of the transition to flowering [Wilson, R. N. et al. 
(1992) Plant PhysionOO: 403-408; Finkelstein, R.R. and Zeevart, J. A. D. 
et al. (1994) Cold Spring Harbor Laboratory: 523-553; Hooley, R. (1994) 
Plant Mol. Biol. 26:1529-1555; Swain, S. M., Olszewski, N. E. (1996) Plant 

5 Physiol 1 1 2:1 1 -1 7 ; Blazquez, M. A. et al. (1 997) Development 1 24: 3835- 
3844; Blazquez, M. A. et al. (1998) The Plant Cell 10:791-800]. 

The ability to regulate developmental identity, such as embryonic 
identity, especially in plants, allows one to produce plants that have 
advantageous embryonic characteristics. For example, crops may be 

10 produced that include an economically significant quantity of oil. Moreover, 
plants that exhibit delayed flowering or reduced height may be valuable. 

Although some information regarding regulation of developmental 
identity is known in Arabidopsis thaliana, identification of other proteins 
involved in regulation of developmental identity in lower eukaryotes could 

is lead to identification of similar proteins in higher eukaryotes, including 
humans. Moreover, identification of such proteins can lead to the 
identification of substances that may work together with the aforementioned 
proteins in regulating developmental identity. There is therefore a need for 
nucleic acid sequences and proteins involved in regulating developmental 

20 identity. The present invention addresses this need. 



30 
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SUMMARY OF THE INVENTION 

A protein that functions in regulating developmental identity has 
been identified in the plant Arabidopsis thaliana. The protein is 

5 characterized by the presence of a zinc finger domain, two chromo 

domains, a helicase domain, and a DNA binding domain. This is the first 
demonstration that proteins having such features are able to regulate 
developmental identity, such as, for example, by terminating a previous 
developmental program. Accordingly, the present invention provides 

10 purified proteins having these features, including PKL (PICKLE). The 
invention further provides isolated nucleic acid molecules that include 
nucleotide sequences encoding these functional proteins. Recombinant 
nucleic acid molecules are also provided that include the nucleotide 
sequence encoding these proteins. The nucleic acid molecules may be 

is incorporated in a host cell. Methods of transforming host cells in order to, 
for example, regulate developmental identity in the cells are also provided. 

In a first aspect of the invention, a method of transforming a host cell 
is provided that includes introducing into a host cell a nucleic acid molecule 
encoding a protein having at least one chromo domain, a helicase domain 

20 and a DNA binding domain. The protein is advantageously expressed in 
an amount sufficient to regulate developmental identity. In other forms of 
the invention, a method may include introducing into a host cell a nucleic 
acid molecule encoding a protein functioning in regulating developmental 
identity wherein the nucleic acid molecule or the protein has the nucleotide 

25 or amino acid sequence, respectively, as described herein. 

In a second aspect of the invention, a method of transforming a host 
cell may include introducing into a host cell an antisense DNA or RNA 
molecule that includes a nucleotide sequence complementary to a length of 
nucleotides within either a nucleic acid molecule as described herein or 

30 within a nucleic acid molecule that encodes a protein having at least one 
chromo domain, a helicase domain, and a DNA binding domain as 
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described herein. The host cell may then be cultured under conditions 
effective for hybridization of the antisense DNA or RNA molecule to nucleic 
acid of the host to regulate developmental identity. In another form of the 
invention, in a method of transforming a host cell, an antisense nucleic acid 

5 molecule complementary to an RNA transcript is generated by introducing 
into a host cell a first nucleic acid molecule having a nucleotide sequence 
that is complementary to a nucleotide sequence having at least about 50% 
identity to a length of nucleotides within the nucleotide sequence set forth 
in SEQ ID NO:1. After generating the antisense nucleic acid molecule, the 

10 host cell is cultured under conditions effective for hybridization of the 
antisense molecule to the RNA transcript of the host cell. 

In a third aspect of the present invention, methods of expressing a 
PKL protein are provided that include introducing into a host cell the 
nucleotide sequences described herein and culturing under conditions 

15 effective to achieve expression of the protein. 

In a fourth aspect of the present invention, recombinant nucleic acid 
molecules are provided that include the nucleotide sequences encoding a 
protein as described herein along with a foreign promoter that is operably 
linked to a terminal 5' end of the nucleotide sequence. Eukaryotic host 

20 cells and transgenic plants are also provided that include the introduced 
nucleotide sequences described herein, as are recombinant proteins. 
Further provided are isolated nucleotide sequences having the nucleotide 
sequences described herein, including those encoding the domains 
described herein. 

25 It is an object of the invention to provide nucleotide sequences 

encoding proteins involved in regulating developmental identity, as well as 
the amino acid sequences encoding the proteins. 

It is a further object of the invention to provide constructs, eukaryotic 
cells and transgenic plants that include the introduced nucleotide 

30 sequences described herein. 
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it js yet another object of the invention to provide methods for 
utilizing the nucleotide and amino acid sequences described herein, 
advantageously to regulate developmental identity. 

These and other objects and advantages of the present invention 
5 will be apparent from the descriptions herein. 
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BRIEF DESCRIPTION OF THE FIGURES 

FIG 1 depicts a genetic map of the region surrounding PKL. Markers 
E11M48, E11m49 pkl, E14M59, GPA-1 and nga1126 are shown below the 
5 line, whereas the distance in cM of the locus from pkl is indicated above the 
line. The extent of YAC (CIC8H12) and BAC (T3H2) clones covering the 
region is illustrated. 

FIG 2 depicts a Southern blot performed as described in Example 1 , 
10 showing polymorphisms associated with two fast neutron-derived alleles of 
PKL. PKL (lane 1), pkl-7 (lane 2), and pkl-9 (lane 3) genomic DNA were 
digested with Xba I and probed with the Sal I fragment indicated in FIG. 3. 
The numbers to the left of the figure indicate size standards. 

15 FIG. 3 depicts a restriction map that highlights various features of 

the PKL locus as discussed in Example 1 . The relative position of four open 
reading frames (ORFs) (P450, dpB, PKL and 2-CR) are indicated as well 
as the region of genomic DNA that was found not to be altered in the fast 
neutron-derived PKL alleles pkl-7 and pkl-9. The portion of genomic DNA 

20 that was used as a probe in FIG. 2 is indicated in addition to the fragment 
that was used to complement the pkl mutant. BamHI, Sail, BstBI, and Ncol 
represent respective restriction endonuclease cleavage sites. 

FIGS. 4A and 4B depict complementation of pkl phenotype in pkl 
25 plants as discussed in Example 1. Complementation of pkl-1 seedling (FIG. 
4A) and mature pkl-1 plant (FIG. 4B) phenotype with vector carrying PKL is 
shown. For each FIG., the plant on the left is PKL, the plant in the middle is 
pk1-1, and the plant on the right is pfc7-Mransformed with pJ0634, as 
described in Example 1, which carries the PKL gene. The seedlings (FIG. 
30 4A) were grown in the presence of 10~ 8 M uniconazole-P in continuous 

light. The mature plants (FIG. 4B) were grown under 18 hour illumination. 
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FIG. 5 shows a schematic diagram illustrating the location of 
domains of sequence homology found in PKL and other CHD proteins from 
Arabidopsis and other species as discussed in Example 2. CHD3 proteins 
5 contain PHD zinc fingers whereas CHD1 proteins do not. 

FIG. 6 depicts gel analysis of a ribonuclease protection assay as 
discussed in Example 2. Ribonuclease protection assays were performed 
to determine the level of the PKL transcript in the root, rosette, 
10 inflorescence, and siliques of Arabidopsis. To demonstrate that the probe 
utilized was specific for PKL, a ribonuclease protection assay using the 
same probe was performed with RNA isolated from a wild-type plant and a 
plant carrying a deletion allele of PKL, pkl-9 (panel on right). A probe for the 
cyclophilin transcript ROC3 was used as a positive control. 

15 

FIG. 7 depicts a gel analysis of a ribonuclease protection assay, 
indicating that LEC1 is expressed in pickle roots, as discussed in Example 
3. Ribonuclease protection assays were used to determine the level of the 
LEC1 transcript in the rosette, silique, and root of wild-type plants as well 
20 as in the pickle root of pkl plants. A probe for the cyclophilin transcript 
ROC3 was used as a positive control. 

FIG. 8 depicts a gel analysis of a ribonuclease protection assay, 
indicating that LEC1 is expressed in germinating pkl seeds, as discussed in 
25 Example 3. Ribonuclease protection assays were used to determine the 

level of the LEC1 transcripts in wild-type (WT) and pkl seeds at 12, 24, and 
36 hours after imbibition in the absence or presence of uniconazole-P (IT). 
A probe for the cyclophilin transcript ROC3 was used as a positive control. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 

For the purposes of promoting an understanding of the principles of 
the invention, reference will now be made to preferred embodiments and 

5 specific language will be used to describe the same. It will nevertheless be 
understood that no limitation of the scope of the invention is thereby 
intended, such alterations and further modifications of the invention, and 
such further applications of the principles of the invention as illustrated 
herein, being contemplated as would normally occur to one skilled in the art 

10 to which the invention relates. 

A protein that functions in regulating developmental identity has 
been identified in the plant Arabidopsis thaliana. The protein is 
characterized by the presence of a zinc finger domain, two chromo 
domains, a helicase domain, and a DNA binding domain. This is the first 

is demonstration that proteins having such features are able to regulate 
developmental identity, such as, for example, by terminating a previous 
developmental program. Accordingly, the present invention provides 
purified proteins having these features, including PKL. The invention 
further provides isolated nucleic acid molecules that include nucleotide 

20 sequences encoding these functional proteins. Recombinant nucleic acid 
molecules are also provided that include the nucleotide sequence encoding 
these proteins. The nucleic acid molecules may be incorporated in a host 
cell. In other aspects of the invention, methods of transforming host cells 
and methods of regulating developmental identity in a host cell are also 

25 provided. 

In a first aspect of the invention, purified proteins are provided that 
include at least one chromo domain, a helicase domain, and a DNA binding 
domain. In preferred forms of the invention, the protein further includes a 
at least one zinc finger domain and preferably two chromo domains, such 
30 as found in PKL, wherein the protein functions in regulating developmental 
identity. As defined herein and as known in the art, developmental identity 
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refers generally to the identity of a tissue during, or at a stage of, 
development that is brought about by expression of selected genes. For 
example, selected genes may be expressed in a plant that gives rise to 
embryonic roots, and thus the developmental identity of the root is 
embryonic. Furthermore, selected genes may be expressed in a plant that 
gives rise to seedling roots, and thus the developmental identity of the root 
is seedling. With specific reference to PKL in pickle roots, one or more 
genes that gives rise to embryonic roots and one or more genes that gives 
rise to seedling roots are expressed simultaneously, thus the 
developmental identity of the root is both embryonic and seedling. The 
polypeptides described herein are substantially pure (i.e., the proteins are 
essentially free, e.g., at least about 95% free, from other proteins with 
which they naturally occur). In one preferred embodiment, the amino acid 
sequence of a PKL protein having the domains described above, originally 
found in Arabidopsis thaliana, is set forth in SEQ ID:2. 

Although the invention is described with reference to Arabidopsis 
thaliana amino acid sequences, it is understood that the invention is not 
limited to the specific amino acid sequence set forth in SEQ ID:2. Skilled 
artisans will recognize that, through the process of mutation and/or 
evolution, polypeptides of different lengths and having differing 
constituents, e.g., with amino acid insertions, substitutions, deletions, and 
the like, may arise that are related to, or sufficiently similar to, a sequence 
set forth herein by virtue of amino acid sequence homology and 
advantageous functionality as described herein. The term "PKL protein" is 
used to refer generally to a protein having the features described herein 
and a preferred example includes a polypeptide having the amino acid 
sequence of SEQ ID NO:2. Also included within this definition, and in the 
scope of the invention, are variants of the polypeptide which function in 
regulating developmental identity, as described herein. Preferred proteins 
are recombinant proteins. 
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It is well known that organisms of a wide variety of species 
commonly express and utilize homologous proteins, which include the 
insertions, substitutions and/or deletions discussed above, and yet which 
effectively provide similar function. For example, an amino acid sequence 

5 isolated from another species may differ to a certain degree from the 

sequence set forth in SEQ ID NO:2, and yet have similar functionality with 
respect to catalytic and regulatory function. Amino acid sequences 
comprising such variations are included within the scope of the present 
invention and are considered substantially or sufficiently similar to a 

10 reference amino acid sequence. Although not being limited by theory, it is 
believed that the identity between amino acid sequences that is necessary 
to maintain proper functionality is related to maintenance of the tertiary 
structure of the polypeptide such that specific interactive sequences will be 
properly located and will have the desired activity. Although it is not 

15 intended that the present invention be limited by any theory by which it 
achieves its advantageous result, it is contemplated that a polypeptide 
including these interactive sequences in proper spatial context will have 
good activity, even where alterations exist in other portions thereof. 

In this regard, a variant of the multi-domain protein described herein, 

20 such as a PKL protein variant, is expected to be functionally similar to that 
set forth in SEQ ID NO:2, for example, if it includes amino acids which are 
conserved among a variety of species or if it includes non-conserved amino 
acids which exist at a given location in another species that expresses a 
functional PKL protein. 

25 Another manner in which similarity may exist between two amino 

acid sequences is where a given amino acid of one group (such as a non- 
polar amino acid, an uncharged polar amino acid, a charged polar acidic 
amino acid or a charged polar basic amino acid) is substituted with another 
amino acid from the same amino acid group. For example, it is known that 

30 the uncharged polar amino acid serine may commonly be substituted with 
the uncharged polar amino acid threonine in a polypeptide without 
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25 



substantially altering the functionality of the polypeptide. If one is unsure 
whether a given substitution will affect the functionality of the enzyme, then 
this may be determined without undue experimentation using synthetic 
techniques and screening assays known in the art. 

The invention therefore also encompasses amino acid sequences 
similar to the amino acid sequences set forth herein that have at least 
about 30% identity thereto and function in regulating developmental 
identity. Preferably, inventive amino acid sequences have at least about 
50% identity, further preferably at least about 70% identity, more preferably 
at least about 80% identity and most preferably at least about 90% identity 
to these sequences. 

In preferred embodiments, the invention also encompasses amino 
acid sequences similar to the amino acid sequences making up 
polypeptides having the domains described herein. For example, the 
invention encompasses amino acid sequences that have at least about 
50%, preferably at least about 70% and more preferably at least about 90% 
identity to a first chromo domain from amino acid 1 15 to amino acid 151 or 
a second chromo domain extending from amino acid 191 to amino acid 
227, at least about 50%, preferably at least about 70%, and more 
preferably at least about 90% identity to a helicase domain extending from 
amino acid 293 to amino acid 739, and at least about 50%, preferably at 
least about 70% and more preferably at least about 90% identity to a DNA 
binding domain extending from amino acid 1069 to amino acid 1095, and 
combinations thereof, all as set forth in SEQ ID NO:2. The invention further 
encompasses amino acid sequences, in addition to those amino acid 
sequences described above, that have at least about 50%, preferably at 
least about 70% and more preferably at least about 90% identity to the zinc 
finger domain amino acid sequence from amino acid 49 to amino acid 96. 

Percent identity may be determined, for example, by comparing 
sequence information using the advanced BLAST computer program, 
version 2.0, available from the National Institutes of Health. The BLAST 
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program. is based on the alignment method of Karlin and Altschul, Proc. 
Natl. Acad. Sci. USA 87:2264-68 (1990) and as discussed in Altschul, et 
al., J. Mol. Biol. 215:403-10 (1990); Karlin and Altschul, Proc. Natl. Acad. 
Sci. USA 90:5873-7 (1993); and Altschul et al. (1997) Nucleic Acids Res. 
5 25:3389-3402. Briefly, the BLAST program defines identity as the number 
of identical aligned symbols (i.e., nucleotides or amino acids), divided by 
the total number of symbols in the shorter of the two sequences. The 
program may be used to determine percent identity over the entire length of 
the proteins being compared. Known default parameters are typically 
10 used, in addition to the following user-defined parameters for the BLAST 
program, blastp: (1) Expect value of 10.0; (2) gap penalties: Existence 11, 
Extension 1 ; and (3) scores for matched and mismatched amino acids 
found in the BLOSUM62 matrix as described in Henikoff, S. and Henikoff, 
J.G. (1992) Proc. Natl. Acad. Sci. USA 89:1 091 5-1 091 9;Pearson, W.R. 
15 (1995) Prot. Sci. 4:1 145-1 160; and Henikoff, S. and Henikoff, J.G. (1993) 
Proteins 17:49-61 . The program also uses an SEG filter to mask-off 
segments of the query sequence as determined by the SEG program of 
Wootton and Federhen (1993) Computers and Chemistry 1 7: 1 49-1 63. 

In another aspect of the invention, isolated nucleic acid molecules, 
20 originally isolated from Arabidopsis thaliana, are provided that encode a 
functional PKL protein that functions in regulating developmental identity, 
especially in plants. The nucleotide sequence is set forth in SEQ ID NO:1 
wherein the coding sequence is shown from nucleotide 1 to nucleotide 
41 52 or nucleotide 41 55. It is preferred that the nucleotide sequence 
25 includes at least one of the nucleotide sequences spanning nucleotides 
343 to 453 or 571 to 681 , nucleotides 877 to 2217 and 3205 to 3285 in 
SEQ ID NO:1, which represent nucleotide sequences encoding a first 
chromo domain, a second chromo domain, a helicase domain and a DNA 
binding domain, respectively. In other forms of the invention, the 
30 nucleotide sequence further includes, in addition to the nucleotide 

sequences recited above, nucleotide sequences spanning nucleotides 145 
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to 288 in SEQ ID NO:1, which represent a nucleotide sequence encoding a 
zinc finger domain. It is not intended that the present invention be limited 
to these exemplary nucleotide sequences, but include sequences having 
substantial similarity thereto and sequences which encode variant forms of 

5 functional PKL protein as discussed above and as further discussed below. 

The term "isolated nucleic acid," as used herein, is intended to refer 
to nucleic acid which is not in its native environment. For example, the 
nucleic acid is separated from other contaminants that naturally accompany 
it, such as proteins, lipids and other nucleic acid sequences. The term 

10 includes nucleic acid which has been removed or purified from its naturally- 
occurring environment or clone library, and further includes recombinant or 
cloned nucleic acid isolates and chemically synthesized nucleic acid. 

The term "nucleotide sequence," as used herein, is intended to refer 
to a natural or synthetic sequential array of nucleotides and/or nucleosides, 

15 including deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), and 

derivatives thereof. The terms "encoding" and "coding" refer to the process 
by which a nucleotide sequence, through the mechanisms of transcription 
and translation, provides the information to a cell from which a series of 
amino acids can be assembled into a specific amino acid sequence to 

20 produce a functional polypeptide, such as, for example, an active enzyme 
or other protein that has a specific function. The process of encoding a 
specific amino acid sequence may involve DNA sequences having one or 
more base changes (i.e., insertions, deletions, substitutions) that do not 
cause a change in the encoded amino acid, or which involve base changes 

25 which may alter one or more amino acids, but do not eliminate the 

functional properties of the polypeptide encoded by the DNA sequence. 

It is therefore understood that the invention encompasses more than 
the specific exemplary nucleotide sequence of PKL. For example, nucleic 
acid sequences encoding variant amino acid sequences, as discussed 

30 above, are within the scope of the invention. Modifications to a sequence, 
such as deletions, insertions, or substitutions in the sequence, which 
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produce "silent" changes that do not substantially affect the functional 
properties of the resulting polypeptide molecule are expressly 
contemplated by the present invention. For example, it is understood that 
alterations in a nucleotide sequence which reflect the degeneracy of the 

5 genetic code, or which result in the production of a chemically equivalent 
amino acid at a given site, are contemplated. Thus, a codon for the amino 
acid alanine, a hydrophobic amino acid, may be substituted by a codon 
encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, 

10 changes which result in substitution of one negatively charged residue for 
another, such as aspartic acid for glutamic acid, or one positively charged 
residue for another, such as lysine for arginine, can also be expected to 
produce a biologically equivalent product. 

Nucleotide changes which result in alteration of the N-terminal and 

15 C-terminal portions of the encoded polypeptide molecule would also not 
generally be expected to alter the activity of the polypeptide. In some 
cases, it may in fact be desirable to make mutations in the sequence in 
order to study the effect of alteration on the biological activity of the 
polypeptide. Each of the proposed modifications is well within the routine 

20 skill in the art. 

In one preferred embodiment, the nucleotide sequence has 
substantial similarity to the sequence set forth in SEQ ID:1, especially from 
nucleotide 1 to nucleotide 4152 or 4155, preferably at least one of the 
sequences spanning nucleotides 343 to 453 or 571 to 681 , nucleotides 

25 877 to 221 7 and 3205 to 3285 in SEQ ID NO:1 , and variants described 

herein. In other forms of the invention, the nucleotide sequence, in addition 
to having substantial similarity to the above-recited sequences, further has 
substantial similarity to the nucleotide sequence spanning nucleotides 145 
to 288. The term "substantial similarity" is used herein with respect to a 

30 nucleotide sequence to designate that the nucleotide sequence has a 

sequence sufficiently similar to a reference nucleotide sequence that it will 
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hybridize therewith under moderately stringent conditions. This method of 
determining similarity is well known in the art to which the invention 
pertains. Briefly, moderately stringent conditions are defined in Sambrook 
et al. (Eds.), Molecular Cloning: A Laboratory Manua\, 2nd ed. Vol. 1 , pp. 
5 101-104, Cold Spring Harbor Laboratory Press (1989) as including the use 
of a prewashing solution of 5X SSC (a sodium chloride/sodium citrate 
solution), 0.5% sodium dodecyl sulfate (SDS), 1.0 mM ethylene 
diaminetetraacetic acid (EDTA) (pH 8.0) and hybridization and washing 
conditions of 55°C, 5x SSC. A further feature of the polynucleotide is that it 

10 encodes a polypeptide having similar functionality to the PKL protein 
described herein, i.e., functioning to regulate developmental identity. 

In yet another embodiment, nucleotide sequences having selected 
percent identities to the nucleotide sequence set forth in SEQ ID:1 , 
especially with respect to the coding sequence from nucleotide 1 to 

15 nucleotide 4152 or nucleotide 4155 are provided. In one preferred form, 
nucleotide sequences are provided that have at least about 50% identity, 
preferably at least about 60% identity, more preferably at least about 80% 
identity, and most preferably at least about 90% identity to the nucleotide 
sequence set forth in SEQ ID:1 , especially from nucleotide 1 to nucleotide 

20 4152 or nucleotide 4155. In other forms of the invention, nucleotide 

sequences are provided that have at least about 50%, preferably at least 
about 60% identity, more preferably at least about 80% identity, and most 
preferably at least about 90% identity to a nucleotide sequence spanning 
nucleotides 145 to 288, at least one of the sequences spanning nucleotides 

25 343 to 453 or 571 to 681 , nucleotides 877 to 2217 and 3205 to 3285 in 

SEQ ID NO:1 . A further feature is that the nucleotide sequence set forth in 
SEQ ID:1 encodes a protein that functions in regulating developmental 
identity. 

The percent identity may be determined, for example, by comparing 
30 sequence information using the advanced BLAST computer program, 
version 2.0, as described above with reference to amino acid identity. 
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Known default parameters are typically used, in addition to the following 
user-defined parameters for blastn: (1) gap penalties: Existence 1 1 , 
Extension 1 ; and (2) scores for matched and mismatched nucleotides 
found in the blastn matrix as described in Altschul, S.F. et al. (1997) 
5 Nucleic Acids Res. 25:3389-3402 and Zhang, J. (1997) Genome Res. 
7:649-656. 

A suitable DNA sequence may be obtained by cloning techniques 
using cDNA libraries. For example, Arabidopsis thaliana cDNA libraries are 
available commercially or may be constructed using standard methods 

10 known in the art. Suitable nucleotide sequences may be isolated from DNA 
libraries obtained from a wide variety of species by means of nucleic acid 
hybridization or polymerase chain reaction (PCR) procedures, using as 
probes or primers nucleotide sequences selected in accordance with the 
invention, such as those set forth in SEQ ID:1 , nucleotide sequences 

is having substantial similarity thereto, or portions thereof. 

Alternately, a suitable sequence may be made by other techniques 
which are well known in the art. For example, nucleic acid sequences 
encoding a functional PKL protein, or variant thereof, may be constructed 
by recombinant DNA technology, for example, by cutting or splicing nucleic 

20 acids using restriction enzymes and DNA ligase. Furthermore, nucleic acid 
sequences may be constructed using chemical synthesis, such as solid- 
phase phosphoramidate technology. PCR may be used to increase the 
quantity of nucleic acid produced. Moreover, if the particular nucleic acid 
sequence is of a length which makes chemical synthesis of the entire 

25 length impractical, the sequence may be broken up into smaller segments 
which may be synthesized and ligated together to form the entire desired 
sequence by methods known in the art. 

In another aspect of the invention, PKL polypeptides functioning in 
regulating developmental identity and having the amino acid sequences 

30 encoded by nucleotide sequences having substantial similarity to the 
nucleotide sequences described above are also provided. 
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In a further aspect of the invention, recombinant nucleic acid 
molecules, or recombinant vectors, are provided. In one embodiment, the 
nucleic acid molecules include a nucleotide sequence that has the selected 
percent identities, or substantial similarity, both as described herein, to the 

5 nucleotide sequence, or selected regions thereof, set forth in SEQ ID NO:1 . 
In other forms of the invention, the nucleic acid molecules include a 
nucleotide sequence encoding a functional PKL protein. The protein 
produced has the amino acid sequence set forth in SEQ ID:1 , or variants 
thereof as described above. 

o Recombinant vectors may be constructed by incorporating the 

desired nucleotide sequence within a vector according to methods well 
known to the skilled artisan and as described for example, in Sambrook et 
al. (Eds.), Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold 
Springs Laboratory, Cold Spring Harbor, New York (1989). A wide variety 

1 5 of vectors are known that have use in the invention. For example, various 
plasmid and phage vectors are known that are ideally suited for use in the 
invention. For example, pBluescript, pGEM and pUC may be used in the 
invention. In preferred embodiments wherein the host cells are plants, the 
vector may be a T-DNA vector. Representative T-DNA vector systems are 

20 discussed in the following publications: An et al., (1986) EMBOJ. 4:277; 
Herrera-Estrella et al., (1983) EMBO J. 2:987; Herrera-Estrella et al., 
(1985) in Plant Genetic Engineering, New York: Cambridge University 
Press, p. 63. 

In one embodiment, the desired recombinant vector may be 
25 constructed by ligating DNA linker sequences to the 5' and 3' ends of the 
desired nucleotide insert, cleaving the insert with a restriction enzyme that 
specifically recognizes sequences present in the linker sequences and the 
desired vector, cleaving the vector with the same restriction enzyme, 
mixing the cleaved vector with the cleaved insert and using DNA ligase to 
30 incorporate the insert into the vector as known in the art. 
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The vectors may include other nucleotide sequences, such as those 
encoding selectable markers, including those for antibiotic resistance or 
color selection. The vectors also preferably include a promoter nucleotide 
sequence. The desired nucleic acid insert is preferably operably linked to 

5 the promoter. A nucleic acid is "operably linked" to a another nucleic acid 
sequence, such as a promoter sequence, when it is placed in a specific 
functional relationship with the other nucleic acid sequence. The functional 
relationship between a promoter and a desired nucleic acid insert typically 
involves the nucleic acid and the promoter sequences being contiguous 

io such that transcription of the nucleic acid sequence will be facilitated. Two 
nucleic acid sequences are further said to be operably linked if the nature 
of the linkage between the two sequences does not (1) result in the 
introduction of a frame-shift-mutation; (2) interfere with the ability of the 
promoter region sequence to direct the transcription of the desired 

15 nucleotide sequence, or (3) interfere with the ability of the desired 

nucleotide sequence to be transcribed by the promoter sequence region. 
Typically, the promoter element is generally upstream (i.e., at the 5' end) of 
the nucleic acid insert coding sequence. 

A wide variety of promoters are known in the art, including cell- 

20 specific promoters, inducible promoters, and constitutive promoters. The 
promoters may further be selected such that they require activation by 
activating elements known in the art, so that production of the protein 
encoded by the nucleic acid sequence insert may be regulated as desired. 
Preferred promoters are foreign promoters. A "foreign promoter" is defined 

25 herein to mean a promoter other than the native, or natural, promoter which 
promotes transcription of a length of DNA. 

The promoters may be of viral, bacterial or eukaryotic origin, 
including those from plants, plant viruses and animals. As an example, the 
promoter may be of viral origin, including a cauliflower mosaic virus 

30 promoter (CaMV), such as CaMV 35S or 19S, a figwort mosaic virus 

promoter (FMV 35S), or the coat protein of tobacco mosaic virus (TMV). 
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Promoters of bacterial origin include the octopine synthase promoter, the 
nopaline synthase promoter and other promoters derived from native Ti 
plasmid as discussed in Herrera-Estrella et al., Nature, 303:209-213 
(1983). Promoters of animal origin include SV40 and CMV. 

The vectors may further include other regulatory elements, such as 
enhancer sequences, which cooperate with the promoter to achieve 
transcription of the nucleic acid insert coding sequence. By "enhancer" is 
meant nucleotide sequence elements which can stimulate promoter activity 
in a cell, such as a bacterial or eukaryotic host cell. 

Moreover, the vectors may include another nucleotide sequence 
insert that encodes a protein that may aid in purification of the desired 
protein encoded by the desired nucleotide sequence. The additional 
nucleotide sequence is positioned in the vector such that a fusion, or 
chimeric, protein is obtained. For example, a PKL protein may be 
produced having at its C-terminal end linker amino acids, as known in the 
art, joined to the other protein. The additional nucleotide sequence may 
include, for example, the nucleotide sequence encoding glutathione-S- 
transferase (GST). After purification procedures known to the skilled 
artisan, the additional amino acid sequence is cleaved with an appropriate 
enzyme. For example, if the additional amino acid sequence is that of 
GST, then thrombin is used to separate the PKL protein from GST. The 
PKL protein may then be isolated from the other proteins, or fragments 
thereof, by methods known in the art. 

The recombinant vectors may be used to transform a host cell. 
Such methods include, for example, those described in Sambrook et al. 
(Eds.), Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Springs 
Laboratory, Cold Spring Harbor, New York (1989). Once the desired 
nucleic acid has been introduced into the host cell, the host cell may 
produce the PKL protein, or variants thereof, as described above. 
Accordingly, in yet another aspect of the invention, a host cell is provided 
that includes the recombinant vectors described above. 
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A wide variety of host cells may be used in the invention, including 
prokaryotic and eukaryotic host cells. Bacterial host cells such as 
Escherichia coli, HB 101 and XL-1 blue may be advantageously used in the 
present invention. Typical eukaryotic host cells include animal host cells, 

5 such as NIH 3T3, NIH 293, COS, PCK and HeLa, and plant host cells, such 
as Arabidopsis, maize and tobacco protoplasts. 

In yet another aspect of the invention, methods of producing 
functional PKL proteins as described above are provided. In one 
embodiment, the method includes providing a nucleotide sequence 

io described above, or variants thereof, that encodes a functional PKL protein 
that regulates developmental identity in a host cell, and introducing the 
nucleotide sequence into a host cell, as described above. The desired 
nucleotide sequence may be advantageously incorporated into a vector to 
form a recombinant vector. The recombinant vector may then be 

15 introduced into a host cell according to known procedures in the art. Such 
host cells are then cultured under conditions, well known to the skilled 
artisan, effective to achieve expression of the PKL polypeptide. The PKL 
polypeptide may then be purified using conventional techniques. 

In a further aspect of the invention, methods for transforming a host 

20 cell, which preferably allows for regulation of developmental identity, are 

provided. In one form of the invention, a method includes introducing into a 
host cell a nucleic acid molecule encoding a protein having at least one 
chromo domain, a helicase domain and a DNA binding domain, wherein 
the protein functions in regulating developmental identity. In preferred 

25 embodiments, the protein further may include at least one zinc finger 
domain, and further preferably includes two chromo domains. In more 
preferred embodiments, the protein is PKL, or a PKL variant, as described 
herein. The various domains may be encoded by a nucleotide sequence 
having selected percent identities, or substantial similarity, both as defined 

30 above, to the nucleotide sequence set forth in SEQ ID NO:1 , or portions 
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thereof as described herein. The host cell may be cultured under 
conditions effective for production of said protein. 

In preferred embodiments, an amount of protein is produced that is 
effective in regulating developmental identity. For example, the protein 

5 may regulate the transition from embryonic to post-embryonic 

development. In plants, for example, the protein preferably regulates the 
transition from an embryonic state to a seedling state. Although not being 
limited by theory, it is believed that PKL, or variants thereof, may act as a 
chromatin remodeling factor to repress transcription of LEC1 , a protein that 

10 plays a role in regulating embryo development in Arabidopsis thaliana. 

In yet other forms of the invention, the method described above may 
include introducing into the host cell a nucleotide sequence encoding the 
various domains discussed above that have at least the selected percent 
identities to the amino acid sequence set forth in SEQ ID NO:1 described 

is herein. 

Although the methods described herein may be performed to 
promote the transition from an embryonic state to a post-embryonic state, it 
may be advantageous in performing the methods described herein to allow 
the embryonic state to perpetuate after germination by altering the activity, 

20 or decreasing the production of, the protein. For example, inactivation of 
PKL, or variants thereof, in crops with large roots, such as radishes or 
turnips, may lead to production of roots that contain an economically 
significant amount of oil. Moreover, such inactivation may also lead to 
delayed flowering in plants, or to reduced height or expression of 

25 vegetative characteristics in plants, including inflorescences. In animal 

cells, especially mammalian cells such as human cells, altering the activity 
of PKL may aid in expressing particular differentiation attributes and 
regulation of PKL activity may have therapeutic value in human disease. 
As another example, regulation of PKL activity may be a convenient 

30 method to immortalize cells by inducing expression of stem cell 

differentiation characteristics. Alternatively, PKL genes may be potential 
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oncogenes, and loss of their function may lead to cells inappropriately 
expressing stem cell characteristics. Similarly, some teratomas may be 
caused by inactivation of PKL genes, causing the inappropriate expression 
of various differentiation programs. 

Accordingly, in other forms of the invention, proteins are provided 
having the features described herein that are modified so that the 
embryonic state may be maintained after entry into the post-embryonic 
state. In one form of the invention applied to, for example, plant host cells, 
a method of regulating developmental identity may include in vivo 
mutagenesis of the gene present in the host genome that encodes the 
protein described herein in order to alter its activity to provide the desired 
results. For example, a plant may be mutated by methods known to the 
skilled artisan, including chemical methods and homologous recombination 
methods. Moreover, other methods include use of interference RNA, T- 
RNA and fast-neutron mutagenesis. All of these methods are well known 
to the art, and may be found, for example, in Koncz et al. (Eds.) Methods in 
Arabidopsis Research , World Scientific Publishing Co. (1992). 

In yet other forms of the invention, one of the domains, or other 
regions of the proteins described herein, may be deleted in order to 
inactivate, or otherwise decrease the activity of, the PKL protein produced. 
It is realized that all, or a portion of one or more domains may be deleted 
by methods that include PCR mutagenesis and recombinant DNA 
technology, as known in the art and as exemplified in, for example, 
Sambrook et al. (Eds.), Molecular Cloning: A Laboratory Manual, 2nd ed. f 
Cold Spring Harbor Laboratory Press (1989). 

In yet other forms of the invention, a method of transforming a host 
cell, preferably to regulate developmental identity, includes introducing into 
a host cell an antisense nucleotide sequence having a nucleotide sequence 
complementary to a length of nucleotides within a nucleic acid molecule as 
described herein. For example, the nucleic acid molecule may encode a 
protein having at least one chromo domain, a helicase domain, and a DNA 
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binding domain, or other protein as described herein, such as one having 
an amino acid sequence having the selected percent identities to the 
various domains in SEQ ID NO:2 as described herein, including the zinc 
finger domain. The cell is typically cultured for a time period and under 
conditions effective for hybridization of the antisense nucleic acid sequence 
to nucleic acid of the host. The antisense nucleic acid sequence may be 
DNA or RNA. The length of nucleotides the antisense nucleotide sequence 
may be complementary to is typically a length sufficient for hybridization to 
the target nucleic acid sequence so that transcription and/or translation will 
be substantially inhibited and/or production of a functional protein will be 
substantially stopped or otherwise substantially decreased. For example, 
antisense nucleotide sequence may be at least about 25 nucleotides long, 
and may further be about 50 to about 4200 nucleotides long, preferably 
about 100 to about 1000 nucleotides long, and further more preferably 
about 200 to about 500 nucleotides long. In preferred forms of the 
invention, the antisense nucleic acid sequence may be complementary to, 
for example, a region from about nucleotide 2 to about nucleotide 331 set 
forth in SEQ ID NO:1 . In other preferred forms of the invention, the 
antisense nucleic acid sequence may be complementary to a region from 
about nucleotide 3330 to about nucleotide 3710 in SEQ ID NO:1 . 

In yet another form of a method of transforming a host cell, a method 
may include introducing into the host cell a vector that includes a nucleic 
acid molecule that may be used to generate a nucleic acid molecule, such 
as an antisense RNA molecule, that will bind to the endogenous transcript 
in order to inhibit translation of the transcript and to target the transcript for 
degradation. In one form, the method may include introducing into the 
host cell a vector that includes length of nucleotides within the nucleotide 
sequence shown in SEQ ID NO:1 along with the same nucleotides in an 
antisense orientation. As an example, the host cell may be transformed 
with a construct that includes, in the following order, a promoter, operably 
linked to, for example, a PKL fragment as described herein in the sense 
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orientation, an intron, the same PKL fragment in an antisense orientation 
and a terminator, as described in Example 5. A double-stranded transcript 
may be generated in the host cell after the intron is spliced out, which may 
then generate a complementary RNA molecule through a double-stranded 

5 RNA-dependent RNA polymerase. This complementary RNA molecule 
may then bind to the endogenous transcript, such as the messenger RNA 
(mRNA), and target it for degradation, as known in the art. 

Reference will now be made to specific examples illustrating the 
molecules, cells and methods above. It is to be understood that the 

0 examples are provided to illustrate preferred embodiments and that no 
limitation to the scope of the invention is intended thereby. 

EXAMPLE 1 
Cloning of PKL 

5 Plant Material and Media 

The pkt-1 mutation was isolated from an EMS-mutagenized 

population of the Col ecotype [Ogas, J. et al. (1997) Science 277: 91-94]. 

The pkl-7, pkl-8, and pkf-9 alleles were isolated from a fast 

neutron-mutagenized population of the Col ecotype that was obtained from 
o Lehle Seeds (http://www.arabidopsis.com/ cat. # M2F-01 A-04). Plants were 

grown as described previously [Ogas, J. et al. (1997) Science 277: 91-94]. 

Cloning of PKL 

pkl-1 plants of the Col ecotype were crossed to plants of the 
5 Landsberg erecta type to generate a mapping population, and 300 F2 
progeny expressing the pickle root phenotype were isolated. DNA was 
isolated from these progeny using a protocol described by [Liscum, E. and 
Oeller, P. W. (1999) Genome Analysis, P. Offner (Ed.), CRC Press, Boca 
Raton, FL, in press]. The SSLP markers used are described at 
o http://genome.bio.upenn.edu/ SSLPJnfo/ SSLP.html, and the PCR 

analysis of the markers was done as previously described [Bell, C. J. and 



PCTAB00/22725 
IPEA/US 2 8 AUG 200? 



26 

Ecker, J. R. (1994) Genomics 19:137-144]. The AFLP analysis was 
performed as described by Liscum, E. and Oeller, P. W. (1999) Genome 
Analysis, P. Offner (Ed.),. CRC Press, Boca Raton, FL, in press]. The 
AFLP primers used for mapping analysis were as follows: the basic EcoRI 
primer is 5'-AGA CTG CGT ACC ATT TCx y-3' (where x and y indicate 
base pairs added for specificity), shown in SEQ ID NO:3, and the basic 
Msel primer is 5-GAT GAG TCC TGA GTA Axy z-3' (where x, y, and z 
indicate base pairs added for specificity), shown in SEQ ID NO:4. E1 1M48 
denotes the primer pair EcoRI-AA and MselCAC, E1 1M49 denotes the 
primer pair EcoRI-AA and Msel-CAG, and EI4M59 denotes the primer pair 
EcoRI-AT and MselCTA [Alonso-Blanco, C. et al. (1998) Plant J. 14: 259- 
271]. 

To identify polymorphisms in the fast neutron-derived alleles of PKL, 
Southern blots were performed using genomic DNA from plants and 
digoxigenin-labeled probes that were generated from YAC DNA using 
AFLP technology. DNA from YAC CIC8H12 (YAC CIC8H12 was obtained 
from the Arabidopsis Biological Resource Center, Columbus, Ohio) and 
was prepared as described [Gibson, S. I. and Somerville, C. (1992) World 
Scientific: 1 19-143]. Approximately 50 ng of CIC8H12 DNA was utilized in a 
restriction and ligation reaction as described at 
http://carnegiedpb.stanford.edu/methods/aflp.htmi, with the following 
differences: the DNA was only digested with Msel, and only the Msel 
adaptor was ligated on. Five \i\ of this restriction and ligation (RAL) mixture 
was then used in a 100 jxl digoxigenin-labeling PCR reaction (Roche 
Biochemicals, cat. # 1 636 090) with 100 pmol each of 6 Msel-xy primers 
(where x and y indicate base pairs added for specificity). The entire PCR 
reaction was then used to probe a Southern blot as described in the Dig 
User's Guide (Roche Biochemicals, cat. # 1 438 425). Random 
combinations of 6 Msel-xy primers were used to screen for polymorphisms 
in the fast neutron-derived alleles. Polymorphisms were revealed when the 
following 6 primers were utilized: xy = CT, GG, GC, AG, TG, AT. 
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To identity a bacteria) artificial chromosome (BAC) that spanned the 
PKL locus, Southerns were performed using BAC filters and a probe 
generated from the AFLP marker E1 1M49. BAC filters representing the 
Arabidopsis genome were obtained from the Arabidopsis Biological 
5 Resource Center at Ohio State University (stock # CD4-25F). Southern 
blots were performed as described in the Dig User's Guide (Roche 
Biochemicals, cat. # 1 438 425). BAC T3H2 was identified as a positive, 
and DNA was isolated using a midiprep kit and protocol from Qiagen (cat. # 
12143). Approximately 5 ng of T3H2 DNA was utilized to generate a 
10 DIG-labeled AFLP probe as described above for CIC8H12. The same 6 
primers that identified polymorphisms with CIC8H12 also gave 
polymorphisms with T3H2. Bands that were polymorphic in fast 
neutron-derived alleles of PKL were then subcloned from T3H2 and the 
DNA sequence was determined using an ABI 310. 

15 

Complementation of pkl mutant 

A BstBI - Ncol 1 1 .9 kb genomic fragment that spanned the predicted 
CHD gene was subcloned into the plant transformation vector pCambia 
3300 (CSIRO, Canberra) using the BstXI and Xbal sites to generate 

20 pJ0634. pkl-1 and pkl-7 plants were transformed with both empty vector 
and pJ0634 using an in planta transformation protocol with the 
Agrobacterium tumefaciens strain GV3101 [Bechtold, N. et al. (1993) C. R. 
Acad, Sci. Paris 31 6: 1 1 94-1 1 99]. Basta was used to select for 
transformants of T1 progeny. Only pkl plants transformed with pJ0634 were 

25 complemented for the pkl phenotype. The T2 progeny of two independent 
transformants that exhibited a complemented phenotype were examined 
for cosegregation of basta-resistance and complementation. For both lines, 
basta-resistance and complementation cosegregated indicating that 
complementation was due to introduction of the wild-type PKL gene. 

30 Results 
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Fast neutron-derived alleles of PKL were identified to facilitate the 
cloning of PKL by map-based methods. Fast neutron mutagenesis 
generates mutations that consist of chromosomal deletions at a high 
frequency [Bruggemann, E. et al. (1996) Plant J. 10: 755-760]. 
5 Approximately 50,000 fast neutron-mutagenized M 2 seed were screened 
for the pickle root phenotype in the presence of 10~ 8 M uniconazole-P, a GA 
biosynthetic inhibitor [Izumi, K. et al. (1985) Plant Cell Physiol. 26: 821- 
827] that increases penetrance of the pickle root phenotype [Ogas, J. et al. 
(1997) Science 277:91 -94]. Three independent pkl mutants were identified 
io and utilized as described below. 

The pkl mutation was genetically mapped relative to previously 
mapped polymorphisms between the Col and Ler ecotypes of Arabidopsis. 
Plants carrying the pkl-1 allele in the Col ecotype were crossed to wild-type 
Ler plants and 300 F 2 progeny expressing the pickle root phenotype were 
isolated. DNA from the 300 pkl F 2 was used to localize the pkl-1 mutation by 
interval mapping using SSLP markers [Bell, C. J. and Ecker, J. R. (1994) 
Genomics 1 9: 137-144]. The pkl mutation mapped to chromosome 2 near 
the ngal 126 marker (FIG. 1). Based on the analysis of 231 F 2 progeny, the 
pkl-l mutation mapped to within 1 .1 cM of the SSLP marker GPA-I which 
had been anchored on the physical map of chromosome 2 [Wang et al. 
(1997) Plant J. 12:711-730]. Further analysis of the 231 F 2 progeny 
revealed that the AFLP markers [Prabhu, R. R. and Gresshoff, P. M. (1994) 
Plant Mol. Biol. 26: 105-1 16; Alonso-Blanco, C. et al., (1998) Plant J. 14: 
259-271] E1 1 M48 and E14M59 flanked pkl- J and were tightly linked (FIG. 

1)- 

Based on the position of pkl on the physical map of chromosome 2 
[Wang, M. L. et al. (1997) Plant J. 12: 711-730], YAC CIC8H12 was 
selected for further analysis. PCR analysis revealed that CIC8H12 
contained the flanking markers E11M49 and E14M59 (FIG. 1), indicating 
that CIC8H12 spanned the PKL locus (data not shown). Five pools of 
random probes were generated from CIC8H12 by a PCR-based method. 
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These random probe mixtures were then used to probe Southern blots of 
genomic DNA isolated from wildtype plants and the three pkl lines 
generated by fast neutron mutagenesis. One of the probes revealed 
polymorphic bands associated with 2 of the 3 fast neutron alleles (data not 
5 shown). 

We also screened for a BAC clone that spanned the pkl locus. The 
AFLP marker E1 1 M49, which mapped 0.23 cM from pkl, was cloned and 
then used to probe BAC filters covering the Arabidopsis genome [Woo, S. - 
S. et al. (1994) Nucleic Acids Res. 22: 4922-4931 ; Choi, S. D. et al. (1995) 

o Weeds World 2: 17-20]. Several BACs that hybridized to the clone were 
identified. Restriction analysis of these BACs revealed that BAC T3H2 was 
likely to span the pkl locus. T3H2 contained restriction fragments that were 
identical in size to the restriction fragments from wild type that were 
polymorphic in the fast neutron lines. A random probe mixture was 

5 generated from T3H2 by PCR utilizing the same pool of primers used to 
generate a random probe mixture from YAC CIC8H12. This probe mixture 
from T3H2 identified the same polymorphic bands in the fast neutron lines 
as the probe mixture from CIC8H12 (data not shown). 

The nature of the lesions in the fast neutron lines was characterized 

o in greater detail using specific probes generated from T3H2. Various DNA 
fragments from T3H2 were subcloned and used as probes on Southern 
blots of Arabidopsis genomic DNA. One of these Southerns is shown in 
FIG 2. The probe used for this blot was a 10 kb Sal I fragment indicated in 
FIG 3. Lanes 1, 2, and 3 contain genomic DNA digested with Xba I that 

5 was isolated from wild-type plants, fast neutron allele pkl-7, and fast 

neutron allele pkl-9, respectively. Polymorphic bands are seen in pkl-7 and 
pkl-9. Based on Southerns such as the one shown in FIG. 2, the extent of 
the alterations in the genomic DNA in pkl-7 and pkl-9 was deduced to be as 
shown in FIG 3. The mutation in pkl-7 is caused by either a translocation or 

o an insertion whereas the mutation in pkl-9 is caused by a large deletion. 
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Sequencing of the wild-type genomic DNA surrounding the pkl-7 
polymorphism indicated that only one gene is disrupted in both the pkl-7 
and pkl-9 mutants. A 3.0 kb BamHI fragment of genomic DNA that was 
polymorphic in the pkl-7 Yme was sequenced and a portion of a potential 
5 gene encoding a putative CHD protein was identified. Since CHD proteins 
can be greater than 2000 amino acids in length, 17 kb of genomic DNA 
was sequenced to ensure that the entire potential CHD gene was 
sequenced. The Genbank database was searched with the sequenced 17 
kb region using the program BLASTX [Altschul, S. F. et al. (1997) Nuc. 

10 Acids Res. 25: 3389-3402], which translates the DNA of interest in all 6 
reading frames and compares the translations to the protein database. 
Based on this database search, the sequenced 17 kb region contains all or 
part of 4 genes, as indicated in FIG. 3. These 4 genes have sequence 
similarity to a cytochrome P450 monooxygenase, a dpB protease, a CHD 

15 family member, and a 2-component regulator [Ogas, J. et al. (1997) 

Science 277: 91 -94], Only the gene coding for the CHD family member is 
disrupted in both the pkl-7 and pkl-9 mutants (FIG. 3). 

Complementation analyses confirmed that PKL has been cloned. A 
binary vector, pJ0634, carrying an 1 1 .9 kb BstBI - Ncol genomic fragment 

20 that spans the predicted CHD gene (FIG. 3) was constructed and 
transformed into pkl plants, pkl plants transformed with pJ0634 are 
complemented for all p/c/-related phenotypes (FIG. 4), whereas pkl plants 
transformed with the vector alone are not (data not shown). Segregation 
analyses was done on two independent lines transformed with pJ0634 to 

25 confirm that the ability to suppress the pkl mutant phenotype cosegregated 
with the transgene (data not shown). 
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EXAMPLE 2 



Characterization of PKL 



15 



20 



25 



Ribonuclease protection assays. 

Ribonuclease protection assays were performed using the RPA III 
kit from Ambion (cat. # 1414). To generate a PKL-specific probe, a DNA 
fragment was generated via RT-PCR using the primers JOpr244 (5'-TGT 
TGA GCC AGT TAT TCA CGA-3'), (nucleotides 1725-1745 in SEQ ID 
NO:1) shown in SEQ ID NO:5, and JOpr247 (5'-ACC TTT CCA TCA ATT 
CGC TCG-3 ) (sequence complementary to nucleotides 1934-1914 in SEQ 
ID NO:1) shown in SEQ ID NO:6, and subcloned using the pGEM-T vector 
system (Promega, cat. # A3600) in an orientation such that the T7 
promoter would produce an anti-sense transcript. This plasmid was called 
pJ0657. To generate a LEC7-specific probe, a DNA fragment was 
generated via PCR using the primers JOpr273 

(5'CCGCTCGAGAACCCCAATGACCAGCTCAGT-3'), shown in SEQ ID 
NO:7 (the first 3 nucleotides are used as spacers so the restriction enzyme 
will cut properly, the next 6 nucleotides represent the Xhol recognition 
sequence and the last 21 nucleotides are nucleotides 33-53 of LEC1 cDNA 
sequence, Genbank Accession No. AF036684), and JOpr262 (5'- 
CCTTCTTCACTTATACTGACC-3 , ) I shown in SEQ ID NO:8 (sequence 
complementary to nucleotides 672-652 of LEC1 cDNA sequence, Genbank 
Accession No. AF036684), digested with Xhol and Kpnl and subcloned into 
pBluescript SK cut with Xhol and Kpnl to produce pJ0660. To generate a 
ROC3-specific probe, a DNA fragment was generated via PCR using the 
primers JOpr276 (5'-AAGTCTACTTCG AC ATG ACCG-3') , shown in SEQ ID 
NO:9 (nucleotides 65-85 of ROC3 cDNA sequence, Genbank Accession 
No. U40399), and JOpr277 (5'-CTTCCAGAGTCAGATCCAACC-3'), shown 
in SEQ ID NO: 10 (sequence complementary to nucleotides 524-504 of 
ROC3 cDNA sequence, Genbank Accession No. U40399), and subcloned 
using the pGEM-T vector system in an orientation such that the T7 
promoter would produce an anti-sense transcript. This plasmid was called 
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pJ0662. To generate 32 P-labeled RNA probes for RPA analysis, the T7 
Maxiscript kit was used (Ambion cat. #1312) with pJ0657, pJ0660, and 
pJ0662 digested with Notl. The full-length transcripts were gel-purified to 
reduce background. For each ribonuclease protection assay, approximately 
5 2x1 0 4 CPM of probe was added to 10 u,g of total RNA [Verwoerd, T. C. et 
al. (1989) Nuc. Acids Res. 17: 2362-2362]. 

Results - Sequence Comparison 

RT-PCR was used to clone cDNA fragments representing the entire 

io predicted PKL ORF. Subsequently, a BAC that spanned the PKL locus, F1 
3D4 (Acc# AL031 369), was sequenced by another group as part of the 
ongoing effort to sequence the Arabidopsis genome. The sequences were 
identical, with the exception that some of the splice sites that were utilized 
to generate the PKL transcript were different from those predicted by the 

15 computer algorithm (the PKL cDNA sequence is deposited in Genbank, 

accession #AF1 85577). Analysis of the PKL ORF revealed that PKL codes 
for a predicted CHD3 homolog that is 1385 amino acids in length. A search 
of the Genbank database revealed that genomic sequence for another 
Arabidopsis CHD3 homolog that is located on chromosome V (Accession # 

20 AAC79140) has also been obtained by the genome project. Also, an 

Arabidopsis CHD1 homolog is located on chromosome IV (Accession # 
CAB40760). We refer to this other CHD3 homolog as PICKLE RELATED 1 
(PKR1) and the CHD1 homolog as PICKLE RELATED 2 (PKR2) 
PKL, PKRl, and PKR2 contain all of the sequence domains 

25 expected of CHD proteins [Delmas, V. et al. (1993) Proc. Natl. Acad. Sci. 
USA 90: 2414-2418; Woodage, T. et al. (1997) Proc. Natl. Acad. Sci. USA 
94: 1 1472-1 1477]. CHD proteins are defined by three domains of 
sequence similarity: a chromo (chromatin organization modifier) domain, a 
SNF2-retated helicase/ATPase domain, and a DNA-binding domain. CHD3 

30 proteins are distinguished from CHD1 proteins by the presence of another 
domain, a PHD zinc finger [Woodage, T. et al. (1997) Proc. Natl. Acad. 
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ScL USA 94: 1 1472-1 1477]. FIG. 5 is a schematic of the various domains 
found in PKL, PKR1, PKR2 and related CHD proteins. Table 1 lists the 
percent similarity between domains in PKL and related domains in the 
other proteins. 



Table 1 . Comparison of sequence identity of various domains found in 
other CHD proteins to domains found in PKL. Percent identity is indicated 
for PHD zinc fingers (PHD), chromo domains (chromo), SNF2-related 
helicase/ATPase domain (helicase), and DNA binding domain (DNA). For 
10 the PHD zinc fingers, both of the PHD zinc fingers from the other CHD3 
proteins are compared to the single PHD zinc finger from PKL. 





PHD#1 


PHD#2 


Chromo#1 


Chromo#2 


Helicase 


DNA 


Human 
CHD3 


33 


35 


32 


37 


58 


40.7 


Drosophila 
CHD3 


31.2 


38 


35 


45 


55 


44 


PRK1 




33 


24 


42 


51 


30 


PKR2 






68 


78 


75 


74 


Yeast 
CDH1 






19 


35 


49 


49 


Mouse 
CHD1 






32 


27 


50 


33 


Length 
(amino acids) 


48 


48 


37 


37-38 


452-469 


27 



Only one PHD zinc finger is found in PKL and PKRI, whereas 2 PHD zinc 
fingers are typically found in CHD3 proteins from other species. Based on 
the domains of homology identified, we have classified PKL and PKR1 as 

20 CHD3 family members and PKR2 as a CHDI family member. PKRI is 

distinguished from the other CHD3 proteins by the fact that the PHD zinc 
finger is located more towards the N-terminus of the protein than the PHD 
zinc fingers of the other CHD3 proteins. 

PKL appears more similar to the putative CHD1 protein PKR2 than 

25 the putative CHD3 protein PKR1. Several pairwise comparison programs 
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were unable to correctly align all of the various domains of PKL and PKR1 , 
whereas PKL and PKR2 were correctly aligned and exhibit 54% sequence 
identity over the entire protein. In this regard, it is interesting to note that 
the spacing between the SNF2-related helicase domain and the putative 
5 DNA-binding domain that is observed in PKL is more similar to that of 
CHD1 proteins than that of CHD3 proteins (FIG. 5). 

Results-Expression of PKL 

To determine where PKL is normally expressed, PKL transcript 

io levels were analyzed. The PKL transcript was not detected by Northern 

analysis of poly(A+) mRNA of rosette leaves. This may be due to technical 
difficulties associated with preparation of long transcripts from plant tissues 
[Roesler, K. R. et al. (1994; Plant Physiol 105: 61 1-617]. Therefore, 
ribonuclease protection assays were used to quantitate PKL mRNA (FIG. 

15 6). At this level of resolution, the PKL transcript was present at 

approximately equal levels in all tissues examined: roots (lane 1), shoots 
(lane 2), inflorescences (lane 3), and siliques (lane 4). This ubiquitous 
expression pattern is consistent with the pleiotropic shoot and root 
phenotypes exhibited by pkf plants. The PKL transcript was not detected 

20 when the ribonuclease protection assay was performed on RNA isolated 
from a plant carrying a deletion allele of PKL, pkl-9 (lanes 5 and 6). 

EXAMPLE 3 

25 Expression of LEC1 

Pickle roots are primary roots of adult plants that express embryonic 
differentiation traits such as expression of storage protein genes and 
accumulation of storage lipids [Ogas, J. et al. (1997) Science 277: 91-94]. 
30 These and other embryo-specific traits are thought to be under control of 
the LEC1 gene, which has been proposed to be a critical regulator of 
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embryonic identity [Meinke, D. W. (1992) Science 258: 1647-1650; 
Meinke, D. W. et al. (1994) Plant Cell 6: 1049-1064; West, M. A. L. et al. 
(1994) Plant Cell 6: 1731-1745; Parcy, F. etal. (1997) Plant Cell 9: 1265- 
1277; Lotan, T. et al. (1998) Ce//93(7): 1195-1205]. Therefore, the 

5 possibility that the LEC1 transcript, which is normally only expressed in 
seeds, was expressed in pickle roots was investigated. Ribonuclease 
protection assays were performed using total RNA isolated from wild-type 
roots and pickle roots with a LEC1 probe and a cyclophilin probe as a 
control (FIG. 7). As expected, the LEC1 transcript was detected in siliques 

o (lane 2) but not in rosette leaves (lane 1). Although the LEC1 transcript was 
not detected in wild-type roots (lane 3), expression of LEC1 was clearly 
detected in pickle roots (lane 4). 

Since expression of LEC1 is sufficient to induce expression of 
embryonic differentiation traits in seedlings [Lotan, T. (1998) Cell 93(7): 

is 1 195-1205], the presence of the LEC1 transcript in pickle roots suggested 
that LEC1 may play a key role in promoting expression of the pickle root 
phenotype. Penetrance of the pickle root phenotype in pkl seedlings is 
induced by treatment of seed with uniconazole-P prior to germination. If 
the level of the LEC1 transcript is the limiting factor in determining the 

20 penetrance of the pickle root phenotype, then the LEC1 transcript would be 
predicted to exhibit uniconazole-P dependent expression in imbibed pkl 
seeds. 

It was found that the LEC1 transcript was present in imbibed pkl 
seeds prior to germination (FIG. 8). Ribonuclease protection assays were 

25 performed using total RNA isolated from wild-type seed (lanes 1 -6) and pki 
seed (lanes 7-12) with a LEC1 probe and a cyclophilin probe as a control. 
Seeds were imbibed in the absence or presence of uniconazole-P for 12, 
24 or 36 hours. The LEC1 transcript is clearly present in pkl seeds at 24 
hours and 36 hours. However, the level of the LEC1 transcript was not 

30 elevated in pkl seed treated with uniconazole-P. 
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Analysis 

PKL is a CHD3 gene 

In wild-type Arabidopsis, many of the developmental pathways that 
contribute to embryo formation are not expressed in adult tissues. In pkl 
5 mutants, at least some aspects of this stage-specific control are lost; 
embryonic developmental programs such as expression of seed storage 
protein genes and genes involved in storage lipid deposition are expressed 
after germination [Ogas, J. et al. (1997) Science 277: 91-94]. Moreover, 
vegetative tissues have an abnormal capacity to spontaneously produce 

10 somatic embryos. In pkl seedlings, all organs generated during 

embryogenesis are capable of expressing embryonic identity after 
germination [Ogas, J. et al. (1997) Science 277: 91-94] (manuscript in 
preparation). By contrast, organs that arise post-embryonically, such as 
secondary roots, never express embryonic traits [Ogas, J. et al. (1997) 

15 Science 277: 91-94] (unpublished observations). Thus, PKL is apparently 
necessary to repress embryonic identity and contributes to the transition 
from embryonic to post-embryonic development. 

The identification of PKL as a gene encoding a CHD3 protein 
suggests that PKL mediates its effects on developmental identity through 

20 regulation of chromatin architecture. CHD genes have been identified in 
numerous eukaryotes, and the corresponding proteins are proposed to 
function as chromatin remodeling factors. The name "CHD" is derived from 
the three domains of sequence homology found in CHD proteins [Delmas, 
V. (1993) USA 90: 2414-2418; Woodage, T. et al. (1997) USA 94: 11472- 

25 1 1477] a chromo (chromatin organization modifier) domain, a SNF2-related 
helicase/ATPase domain, and a DNA-binding domain. Chromo domains 
are proposed to function as protein-protein interaction domains [Cowell, I. 
G. and Austin C. A. (1997) Biochim. Biophys. Acta 1337:198-206] and are 
found in numerous chromatin-associated proteins [Koonin, E. et al. (1995) 

30 Nuc. Acids Res. 23: 4229-4233]. The SNF2-related helicase/ATPase 
domain is found in numerous proteins that exhibit different activities 
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towards DNA [Eisen, J. A. et al. (1995) Nuc. Acids Res. 23: 2715-2723]. 
The SNF2-related helicase/ATPase domain found in CHD genes exhibits 
highest sequence similarity to the SWI/SNF class of transcriptional 
activators, which are proposed to remodel chromatin [Hirschhom, J. N. et 

5 al. (1992) Genes & Dev. 6: 2288-2298; Prelich, G. and Winston, F. et al. 
(1993) Genetics 135: 665-676; Imbalzano, A. N. et al. (1994) Nature 
370(6489): 481-5; Kwon, H. etal. (1994) Nature 370(6489): 477-81; 
Kruger, W. et al. (1995) Genes & Dev. 9: 2770-2779; Owen-Hughes, T. et 
al. (1996) Science 273(5274): 513-6; Logie , C. and Peterson, C. L. (1997) 

10 Embo J 1 6(22): 6772-82] by an as yet undetermined mechanism. The DNA 
binding domain of the CHD proteins is most similar to that of the telobox 
subset of Myb-related DNA-binding motifs [Woodage, T. et al. (1997) Proc. 
Natl. Acad. Sci. USA 94: 1 1472-1 1477]. Thus, CHD proteins are a unique 
juxtaposition of three domains with chromatin-related activities in a single 

15 polypeptide. 

At present, four CHD genes have been sequenced from 
Arabidopsis: PKL, PKRI, PKR2 and PKR3. CHD proteins are separated 
into two classes, CHD1 and CHD3, based on domains of homology found 
in the proteins [Woodage, T. et al. (1997) Proc. Natl. Acad. Sci. USA 94: 

20 11 472-1 1 477]. CHD3-related proteins are distinguished from CHD1 -related 
proteins by the presence of an additional domain of homology, the PHD 
zinc finger [Woodage, T. et al. (1997) Proc. Natl. Acad. Sci. USA 94: 
1 1472-1 1477]. PKL and PKR1 both have a single PHD zinc finger. Based 
on the presence of that motif, we have classified them as CHD3 proteins. 

25 This classification brings with it certain experimental predictions; CHD3 

proteins have been shown to be associated with histone deacetylases (see 
below). PKR2 and PKR3 do not have a PHD zinc finger and so we have 
classified them as CHD1 proteins. 

CHD3 proteins are thought to be involved in repression of 

30 transcription. CHD3 proteins from Xenopus and human have been show to 
be a component of a complex that contains histone deacetylase as a 
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subunit [Tong, J. K. et al. (1998) Nature 395: 917-921 ; Wade, P. A. et al. 
(1998) Curr. Biol. 8: 843-846; Zhang, Y. et al. (1998) Cell 95(2): 279-289]. 
Deacetylation of histones is correlated with transcriptional inactivation 
[Turner, B. M. (1991) J. Cell Sci. 99:13-20; Grunstein, M. (1997) Nature 
5 389:349-352; Struhl, K. (1998) Genes & Dev. 12:599-606]. Thus, by virtue 
of CHD3 proteins being a component of a histone deacetylase complex, 
they would be predicted to function as repressors of transcription. In a 
mutant of Drosophila that lacks the CHD3-related gene dMi-2, this 
prediction is borne out; homeotic genes that are normally repressed are 
10 derepressed in a dMi-2 mutant [Kehle, J. et al. (1998) Science 282(5395): 
1897-1900]. 

There is little published evidence of the function of CHD1 proteins. 
Deletion of the only CHD gene in yeast, a CHDI gene, does not result in a 
phenotype under standard growth conditions However, chdl yeast exhibit 

15 increased resistance to the pyrimidine analog 6-azauracil, a phenotype 
which is consistent with a role for CHD1 in repression of transcription 
[Woodage, T. et al. (1997) Proc. Natl Aad. Sci. USA 94: 1 1472-1 1477]. 

Based on the data presented here and previously, it is proposed 
herein that PKL also functions as a repressor of transcription. In pkl 

20 mutants, embryo-specific genes are expressed inappropriately after 
germination [Ogas, J. et al. (1997) Science 277: 91-94]. Such 
derepression could be due to loss of a shared repressor of embryo-specific 
genes or due to inappropriate expression of a general activator of the 
embryo-specific genes. LEC1 codes for a seed-specific transcription factor 

25 and is a critical activator of the embryonic developmental program [Lotan, 
T. (1998) Ce//93(7): 1195-1205]. We have shown that LEC1 is expressed 
in pkl tissue expressing embryonic differentiation characteristics after 
germination. 

Since expression of LEC1 after germination is sufficient to cause 
30 expression of embryonic differentiation characteristics [Lotan, T. (1998) 
Ce//93(7): 1195-1205], one possible model to explain expression of 
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embryonic identity after germination in pkl seedlings is that PKL is 
necessary for repression of LEC1. We found that LEC1 is expressed in pkl 
seeds prior to germination (FIG. 8), but the level of the LEC1 transcript is 
not increased in the presence of uniconazole-P. Based on what is known 
about penetrance of the pickle root phenotype, PKL, and LEC1, this result 
is consistent with a direct role for PKL in repression of LEC1 and with a 
substantive role for LEC1 in generation of the pickle root phenotype. 
However, this result is not consistent with a role for LEC1 as a 
rate-determining factor governing penetrance of the pickle root phenotype. 
In fact, the result strongly suggests that there is a separate factor that 
promotes expression of embryonic genes that is in some way repressed by 
GA. 

PKL is a component of a GA-dependent developmental switch 

Based on the characterization of the phenotype of the pkl plant 
described in this study and on the identification of PKL as a CHD3 gene, 
the following model is proposed herein to explain the role of PKL in 
regulating developmental identity during germination. Briefly, in response 
to a GA-dependent signal, PKL remodels the chromatin upstream of one or 
more genes that promote embryonic identity into a transcriptionally 
incompetent state. As a consequence of this transcriptional inactivation, 
expression of the embryonic developmental program is repressed after 
germination. In conjunction with previous observations concerning GA, the 
results in this study imply that GA plays two roles in germinating seeds of 
Arabidopsis. One well-established role is that GA triggers metabolic 
activity and activates postembryonic developmental processes. In addition, 
the results in this study indicate that GA plays a role in repression of 
embryonic developmental processes. Thus, it is proposed herein that GA 
acts as both a differentiation factor (promotion of the postembryonic state) 
and a determination factor (repression of the embryonic state) during 
germination. This result is surprising, especially in light of previous results 
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with double mutants of Arabidopsis defective in both ABA and GA 
biosynthesis [Koomeef , M. (1 982) Theor. Appl. Genet. 61 : 385-393]. Such 
mutants germinate in the absence of GA and do not inappropriately 
express embryonic differentiation characteristics after germination. One 
5 possible explanation for this apparent contradiction is that factors in 
addition to GA may be able to promote repression of embryo-specific 
genes. 

It is proposed in this study that PKL activity is, in some way, 
GA-dependent. What has been observed in this study is that pickle root 

10 penetrance is GA-dependent in the absence of PKL What this observation 
implies is that PKL and a factor whose activity is GA-dependent are 
necessary for repression of embryonic genes. The supposition that the 
activity of PKL itself is in some way GA-dependent is based on 
observations that the shoot phenotype of pkl plants is consistent with a 

15 defect in a GA signal transduction pathway [Ogas, J. et al. (1997) Science 
277: 91-94]; (manuscript in preparation). Based on the conclusion that PKL 
is functioning in a GA signal transduction pathway during shoot 
development, it is proposed herein that the activity of PKL is similarly 
regulated by GA during germination. 

20 The hypothesis that PKL remodels chromatin into a transcriptionally 

incompetent state is consistent with published data regarding CHD3 
proteins and with the pkl mutant phenotype. CHD3 proteins have been 
shown to associate with histone deacetylase [Tong, J. K. et al. (1998) 
Nature 395: 917-921 ; Wade, P. A. et al. (1998) Curr. Biol. 8: 843-846; 

25 Zhang, Y. et al. (1998) Cell 95(2): 279-289], and deacetylation of histones 
is correlated with reduced transcription [Turner, B. M. (1991) J. Cell Set. 
99: 13-20; Grunstein, M. (1997) Nature 389: 349-352; Struhl, K. (1998) 
Genes & Dev. 12: 599-606]. In a Drosophila mutant lacking the 
CHD3-related gene M-2, homeotic genes are derepressed [Kehle, J. et al. 

30 (1998) Science 282(5395): 1897-1900]. In the pkl mutant, embryonic genes 
are derepressed after germination [Ogas, J. et al. (1997) Science 277: 91- 
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94]. Here, it has shown that LEC1, a critical activator of embryonic 
development, is similarly derepressed in pickle roots and in pkl seeds prior 
to germination. 

The proposal that transcriptional inactivation of embryo-specific 
5 genes occurs after seed imbibition suggests that the final switch from 
embryonic to post-embryonic development occurs after seed maturation. 
This conclusion, in turn, suggests that seed-specific processes may be a 
developmental subset of embryo-specific processes, rather than a separate 
developmental program inserted between embryonic and post-embryonic 
io developmental programs. 

A general role for chromatin remodeling in GA signal transduction? 

pkl plants exhibit numerous pleiotropies consistent with a defect in 
GA signal transduction. The rosette leaves are dark green with shortened 

15 petioles, time to flowering is increased, apical dominance is reduced, 

anther dehiscence is delayed, and pkl shoots accumulate bioactive GAs 
(Ogas, J. et al. (1997) Science 277: 91-94]; (manuscript in preparation). In 
addition, combining the pkl mutation with a gal mutation, which also 
perturbs GA signal transduction (Koorneef, M. et al. (1985) Theor. Appl 

20 Genet 61 : 385-393; Talon, M. et al. (1990) Planta 182: 501-505; Wilson, R. 
N. and Somerville, C. (1992) Plant Physiol. 108: 495-502; Peng, J. and 
Harber, N. P. (1993) Plant Cell 5: 351-360; Wilson, R. N. and Somerville, 
C. (1 995) Plant Physiol. 1 08: 495-502], gives rise to synergistic phenotypes 
[Ogas et al. (1997) Science 277: 91-94]. Based on these observations, it is 

25 proposed herein that PKL plays a general role in GA signal transduction. It 
is hypothesized in this study that GA promotes transitions from 
differentiation state A to differentiation state B by activating expression of 
genes necessary for state B and by repressing expression of genes 
necessary for state A via a PKL-dependent pathway. This model does not 

30 preclude the possibility that PKL activity may be stimulated by factors other 
than GA. 
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In conclusion, cloning of a gene necessary for repression of 
embryonic identity has lead to the proposition that a GA-modulated 
chromatin remodeling factor mediates a developmental transition in 
Arabidopsis. It is anticipated that further characterization of PKL, and 
identification of proteins that either regulate or are targets of PKL, will shed 
light on the mechanism of GA signal transduction and the role of GA in 
regulating differentiation and development in Arabidopsis. It remains to be 
determined whether CHD proteins in animal systems will play an analogous 
role in hormone-mediated developmental events. 

EXAMPLE 4 

Generation of Mutant PKL by a Dominant Negative Strategy 

It has previously been demonstrated that a point mutation of a 
conserved lysine in the ATPase/helicase domain of SWI/SNF proteins 
generates a dominant negative mutant form of the protein [Chavari et al., 
(1993) Nature 366:170-174). By mutating the analogous mutation in PKL 
(by mutating Lys-304 to an Arg residue), a dominant negative version of 
PKL may be generated. This mutant allele of PKL may be generated by a 
PCR strategy. 

A complementation construct for PKL was generated that includes 
the PKL cDNA flanked by 1 .1 kb of upstream genomic sequence (to the 
BstBI site) and 1 .4 kb of downstream genomic sequence (to the Ncol site). 
The construct was generated by performing overlap PCR on PKL cDNA 
with three DNA fragments: the genomic fragment upstream of the PKL 
start codon to the BstBI site, the PKL cDNA and the genomic fragment 
downstream of the termination codon to the Ncol site. A BstBI - Xhol 
fragment (2.1 kb) from this construct has been subcloned into a modified 
pBluescript vector (pJ0674). The modified pBluescript vector pJ0674 was 
formed by ligating in a cassette' generated by annealing the primers 
JOpr386 (5'-CTTCGAACTCGAGGGATCCCCATGGCTAGCAGCT-3'), 
shown in SEQ ID NO:26 (this is a synthetic sequence that includes "A" 
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followed by the recognition sequence of BstB1 , Xhol, Bam HI, Ncol, Nhe I 
and sequence "AGCT wherein the last "G" in the Ncol recognition 
sequence and the first "G" in the Nhel recognition sequence overlap) and 
JOpr387(5'-GCTAGCCATGGGGATCCCTCGAGTTCGAAGGTAC), as 
shown in SEQ ID NO:27 (this is a synthetic sequence complementary to 
SEQ ID NO:26) after pBluescript was cut with Kpnl and Sacl. The 
resulting cassette include the following restriction sites: BstB1 , Xhol, Bam 
HI, Ncol and Nhel. 2 separate PCR reactions have been performed using 
this vector as a substrate. 1 PCR reaction uses a T3 primer with the 
following primer shown in SEQ ID NO:1 1 (JOpr516) 5'- 
GAAATGGGACTAGGCAGGACAATTCAAAGC-3' (nucleotides 895-924 in 
SEQ ID NO:1) where the underlined G is designed to replace an A residue 
in the wild-type PKL sequence and introduce the Lys-304 to Arg-304 
mutation. This reaction generates a 272 bp fragment. The other PCR 
reaction uses a T7 primer with the following primer shown in SEQ ID NO: 12 
(JOpr517) 5'-GCTTTGAATTGTCCTGCCTAGTCCCATTTC-3' (sequence 
complementary to SEQ ID NO:1 from nucleotides 924-895) where the 
underlined C is designed to replace a T residue in the wild-type PKL 
sequence and introduce the Lys-304 to Arg-304 mutation. This reaction 
generates a 2094 bp fragment. Overlap PCR can then be done by adding 
the 272 bp and 2094 bp fragment together along with the T3 and T7 
primers generating a 2.3 kb fragment. This fragment will be digested with 
BstBI and Xhol, cloned back into pJ0674 and then sequenced to verify 
introduction of the mutation. This vector will then be cut with BstBI and 
Xhol and ligated into a pBluescript-based vector carrying the 
complementation construct (pJ0765, formed by ligating the 
complementation fragment into pJ0674 cut with BstBI and Ncol) cut with 
BstBI and Xhol, resulting in generation of a complementation construct that 
carries the dominant negative mutation. This construct will then be 
transferred to a binary vector [a modified pCAMBIA3300, pJO630, which is 
formed by digesting pCAMBIA3300 with BstXI and EcoRI and ligating in the 
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cassette generated by annealing primers JOpr232 (5'-CCAGGTACCTGG- 
3'), shown in SEQ ID NO:28 and JOpr233 (5'- 
AATTCCAGGTACCTGGCATG-3'), shown in SEQ ID NO:29] and 
transformed into wild-type plants to verify generation of a mutant pkl 
phenotype. These sequences are synthetic sequences that anneal to form 
a cassette that has ends that are compatible to BstXI and EcoRI digested 
pCAMBIA3300. The entire sequence of JOpr232 is a new site that when 
cut with BstXI generates ends that are compatible with Kpnl ends. The 
cassette thus recreates a BstXI site with Kpnl compatible ends. The PCR 
reactions and subcloning are performed as known in the art, and as 
described, for example, in Sambrook et al. (Eds.), Molecular Cloning: A 
Laboratory Manual 2nd ed., Cold Spring Harbor Laboratory Press (1989). 

A conditional version of this dominant negative allele may be made 
by fusing the gene to the glucocorticoid receptor [Lloyd et al., (1994) 
Science 266:436-439). A clone of the rat glucocorticoid receptor (GR) was 
obtained from Alan Lloyd, at the University of Texas, Austin, Texas. The 
clone included SEQ ID NO:30 (5'- 

TCTAGAGGATCCTGAAGCTCGAAAAACAAAGAAAAAAA-3'), that is 
fused to nucleotides 1569-2407 of rat glucocorticoid receptor cDNA found 
in Genbank Accession No. Y12264. SEQ ID NO:30 was used to add 
spacers and restriction sites to the clone. A PCR reaction has been 
performed with this GR clone as a substrate and the following primers: 
JOpr533 (5'- 

AAGCCAAAGAACATGGTCGTTGATCTAGAGGATCCTGAAGCTCGAAA- 
3') shown in SEQ ID NO:13 (the first 24 nucleotides are nucleotides 4129- 
4152 of SEQ ID NO:1 whereas the last 23 nucleotides are nucleotides 2-24 
of SEQ ID NO:30 of the rat glucocorticoid receptor cDNA found in Genbank 
Accession No. Y12264) and JOpr534 (5'- 

GAATCTTGATTTACCAGTTGAGTCATTTTTGATGAAACAGAAGCTTTTT 
GAT-3') (the first 25 nucleotides are nucleotides complementary to 
nucleotides 4153-4177 of SEQ ID NO:1 and the last 27 nucleotides are 
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complementary to nucleotides 2407-2381 of the glucocorticoid receptor 
cDNA found in Genbank Accession No. Y12264) shown in SEQ ID NO: 14, 
which are designed to add PKL sequences to the end of the GR fragment 
such that overlap PCR can be performed. A BamHI -Ncol fragment of the 
complementation construct has been subcloned into pJ0674, generating 
vector pJ0724. pJ0724 may be the substrate for 2 PCR reactions. One 
reaction can use the T7 primer and JOpr398 (5'- 
ATCAACGACCATGTTCTTTGG-3') (sequence complementary to 
nucleotides 4152-4132 of SEQ ID NO:1), shown in SEQ ID NO:15, 
generating a 883 bp fragment. The other reaction will use the T3 primer 
and JOpr401 (5'- TGACTCAACTGGTAAATCAAGA-3') (nucleotides 4153- 
4174 of SEQ ID NO:1), shown in SEQ ID NO:16, generating a 1.5 kb 
fragment. Overlap PCR can then be performed using 883 bp fragment and 
the GR fragment with the T7 primer and JOpr534. Overlap PCR can then 
be performed again using the product of this PCR reaction and the 1 .5 kb 
fragment using the T7 primer and the T3 primer. This PCR product can 
then be digested with BamHI and Ncol and cloned back into pJ0674 
digested with the same. The construct will then be sequenced to verify 
identity. This construct can then be digested with BamHI and Ncol and 
ligated to the dominant-negative version of the complementation construct 
to generate a C-terminal fusion of GR to the mutant PKL protein. Once 
again, this construct can be transferred to a binary vector (pJO630) and 
transformed into wild-type plants to verify that a mutant pkl phenotype will 
be generated upon addition of dexamethasone. 

If necessary, the dominant-negative version of the gene may be 
overexpressed in order to generate a phenotype. In this case, the mutated 
ORF (+/- GR) can be cloned downstream of a constitutive high level 
promoter such as the 35S promoter in a binary vector. 

In all of Examples 4-6 described herein, ribonuclease protection 
assays will be performed to verify expression of the mutant transcript. The 
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pW phenotype will be assayed by penetrance of the pickle root phenotype 
and by the rosette phenotype [Ogas, J. et al. (1997) Science 277:91-94]. 



Two constructs for inhibiting expression of endogenous PKL by 
iRNA may be generated. These constructs are based on sequence 
comparison between PKL and PKR2, which is another CHD protein that 
exhibits high sequence similarity to PKL. A fragment of PKL may be cloned 
into the vector pRNA69, which results in formation of the following 
construct: 35S promoter - PKL frag in sense orientation - intron - the 
same PKL frag in antisense orientation - terminator. Vector pRNA69 is a 
bacterial vector that was obtained from John Bowman at UC Davis. 

The sequence of the PKL cDNA that is being targeted in the first 
construct is from nucleotide 2 to nucleotide 361 in SEQ ID NO:1 . This 
fragment was generated by performing PCR on PKL cDNA with the 
following primers: JOpr442 (5'- 

CCGCTCGAGTGAGTAGTTTGGTGGAGAGGC-3') found in SEQ ID 
NO: 17 (the first 3 nucleotides are used as spacers so the restriction 
enzyme will cut properly, the next 6 nucleotides represent the Xhol 
recognition sequence and the last 21 nucleotides are nucleotides 2-22 of 
SEQ ID NO:1) and JOpr443 (5'- 

CCGGAATTCCATCGGAGGAACCTTGTTCAC-3'), found in SEQ ID NO:18 
(the first 3 nucleotides are used as spacers so the restriction enzyme will 
cut properly, the next 6 nucleotides represent the Eco Rl recognition 
sequence whereas the last 21 nucleotides are complementary to 
nucleotides 361-341 of SEQ ID NO:1), for the cloning the sense orientation 
(as a Xhol-EcoRI fragment) and JOpr444 (5'- 

CGCGGATCCCATCGGAGGAACCTTGTTCAC-3'), shown in SEQ ID 
NO: 19 (the first 3 nucleotides are used as spacers so the restriction 
enzyme will cut properly, the next 6 nucleotides represent the Bam HI 
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recognition sequence and the last 21 nucleotides are complementary to 
nucleotides 361-341 of SEQ ID NO:1) and JOpr445 (5'- 
TGCTCTAGATGAGTAGTTTGGTGGAGAGGC-3'), shown in SEQ ID 
NO:20 (the first 3 nucleotides are used as spacers so the restriction 
enzyme will cut properly, the next 6 nucleotides represent the Xbal 
recognition sequence and the last 21 nucleotides are nucleotides 2-22 of 
SEQ ID NO:1), for cloning the antisense orientation (as a BamHI-Xbal 
fragment) into pRNA69. 

The sequence of the PKL cDNA that is being targeted in the second 
construct is from nucleotide 3330 to nucleotide 3710 in SEQ ID NO:1 . 
This fragment was generated by performing PCR on PKL cDNA with the 
following primers: JOpr446 (5'- 

CCGCTCGAGCCCTCACATAAGTTTGTCTGC-3'), shown in SEQ ID 
NO:21 (the first 3 nucleotides are used as spacers so the restriction 
enzyme will cut properly, the next 6 nucleotides represent the Xhol 
recognition sequence and the last 21 nucleotides are nucleotides 3330- 
3349 of SEQ ID NO:1), and JOpr447 (5*- 

CCGGAATTCGTCTTAGGAAGTCCATCAAGC-3'), shown in SEQ ID 
NO:22 (the first 3 nucleotides are used as spacers so the restriction 
enzyme will cut properly, the next 6 nucleotides represent the Eco Rl 
recognition sequence and the last 21 nucleotides are complementary to 
nucleotides 3710-3690 of SEQ ID NO:1), for the cloning the sense 
orientation (as a Xhol-EcoRI fragment) and JOpr448 (5'- 
CGCGGATCCGTCTTAGGAAGTCCATCAAGC-3'), found in SEQ ID 
NO:23 (the first 3 nucleotides are used as spacers so the restriction 
enzyme will cut properly, the next 6 nucleotides represent the Bam HI 
recognition sequence whereas the last 21 bases are nucleotides 3330- 
3351 in SEQ ID NO:1), and JOpr449 (5'- 

TGCTCTAGACCCTCACATAAGTTTGTCTGC-3'), shown in SEQ ID NO:24 
(the first 3 nucleotides are used as spacers so the restriction enzyme will 
cut properly, the next 6 nucleotides represent the Xbal recognition 
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sequence and the last 21 nucleotides are nucleotides 3330-3350 in SEQ ID 
NO:1)for cloning the antisense orientation (as a BamHI-Xbal fragment) into 
pRNA69. 

The pRNA69 constructs may then be ligated into the binary vector 
pBART by making use of the flanking Notl sites. Wild-type plants may then 
be transformed by these constructs by vacuum infiltration. The plants may 
then be screened for a mutant pkl phenotype as described for Example 5. 

EXAMPLE 6 
Generation of Mutant PKL by Domain Deletion 

It has been shown that removing the DNA-binding portion of CHD1 
in S. cerevisiae generates an inactive form of the protein [Woodage et al., 
(1997) PNAS 94:1 1472-1 1477). By specifically deleting the DNA-binding 
domain (aa 1069 - 1095) or any of the other domains, a dominant negative 
version of PKL may be produced. The Xhol-BamHI fragment of the PKL 
cDNA sequence has been cloned into pJ0687, a vector obtained by 
introducing this fragment into a pJ0674 vector formed as described in 
Example 4. In order to delete the putative DNA binding domain of PKL, 
PCR mutagenesis may be used. Briefly, a PCR reaction may be performed 
using pJ0687 as a substrate and T7 and the oligo 5'- 
CGCGGATCCI I I I I CCACTTCTCAGTCCGGG-3', shown in SEQ ID 
NO:25 (the first 3 nucleotides are used as spacers so the restriction 
enzyme will cut properly, the next 6 nucleotides represent the Bam HI 
recognition sequence and the last 21 nucleotides are complementary to 
nucleotides 3202-3181 of SEQ ID NO:1), as a primer. The product can be 
digested with Xhol and BamHI and cloned into pJ0674 cut with the same, 
and then can be sequenced to verify introduction of the mutation. This 
vector can then be cut with Xhol and BamHI and ligated into a pBluescript- 
based vector, carrying the complementation construct (pJ0765) cut with 
the same, resulting in generation of a complementation construct that 
carries PKL deleted for the DNA binding domain. This construct can then 
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be transferred to a binary vector (a modified pCAMBIA3300, pJO630) 
formed as described in Example 4. Wild-type plants may then be 
transformed by methods described above with the vector to verify 
generation of a mutant pkl phenotype. 

If necessary, the domain-deleted version of the gene can be 
overexpressed in order to generate a phenotype. If overexpression is 
desired, the mutated ORF can be cloned downstream of a constitutive high 
level promoter, such as the 35S promoter, in a binary vector. 

While the invention has been illustrated and described in detail in the 
drawings and foregoing description, the same is to be considered as 
illustrative and not restrictive in character, it being understood that only the 
preferred embodiment has been shown and described and that all changes 
and modifications that come within the spirit of the invention are desired to 
be protected. In addition, all references cited herein are indicative of the 
level of skill in the art and are hereby incorporated by reference in their 
entirety. 
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CLAIMS 

What is claimed is: 

1. A method of transforming a host cell, comprising introducing 
5 into a host cell a nucleic acid molecule encoding a protein having at least 
one chromo domain, a helicase domain and a DNA binding domain, said 
protein expressed in an amount sufficient to regulate developmental 
identity. 

10 2. The method of claim 1 , wherein said nucleic acid molecule 

further encodes a protein having at least one zinc finger domain. 

3. The method of claim 2, wherein said nucleic acid molecule 
further encodes a second chromo domain. 

15 

4. The method of claim 1, wherein said chromo domain is 
encoded by a nucleic acid molecule having a nucleotide sequence having 
at least about 50% identity to the nucleotide sequence set forth in SEQ ID 
NO:1 from nucleotide 343 to nucleotide 453, said helicase domain is 

20 encoded by a nucleic acid molecule having a nucleotide sequence having 
at least about 50% identity to the nucleotide sequence set forth in SEQ ID 
NO:1 from nucleotide 877 to nucleotide 2217, and said DNA binding 
domain is encoded by a nucleic acid molecule having a nucleotide 
sequence having at least about 50% identity to the nucleotide sequence set 

25 forth in SEQ ID NO:1 from nucleotide 3205 to nucleotide 3285. 

5. The method of claim 2, wherein said zinc finger domain is 
encoded by a nucleic acid molecule having a nucleotide sequence having 
at least about 50% identity to the nucleotide sequence set forth in SEQ ID 

30 NO: 1 from nucleotide 1 45 to nucleotide 288. 
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6. 



The method of claim 3, wherein said second chromo domain 



is encoded by a nucleic acid molecule having a nucleotide sequence having 
5 at least about 50% identity to the nucleotide sequence set forth in SEQ ID 



7. The method of claim 1, wherein said nucleic acid molecule 
has a nucleotide sequence encoding protein domains selected from the 

10 group consisting of a chromo domain having an amino acid sequence 
having at least about 50% identity to the amino acid sequence set forth in 
SEQ ID NO:2 from amino acid 115 to amino acid 151, a helicase domain 
having an amino acid sequence having at least about 50% identity to the 
amino acid sequence set forth in SEQ ID NO:2 from amino acid 293 to 

15 amino acid 739 and a DNA binding domain having an amino acid sequence 
having at least about 50% identity to the amino acid sequence set forth in 
SEQ ID NO:2 from amino acid 1069 to amino acid 1095. 

8. The method of claim 2, wherein said nucleic acid molecule 
20 has a nucleotide sequence encoding said zinc finger domain having an 

amino acid sequence having at least about 50% identity to the amino acid 
sequence set forth in SEQ ID NO:2 from amino acid 49 to amino acid 96. 

9. The method of claim 3, wherein said nucleic acid molecule 
25 has a nucleotide sequence encoding said second chromo domain having an 

amino acid sequence having at least about 50% identity to the amino acid 
sequence set forth in SEQ ID NO:2 from amino acid 191 to amino acid 227. 



NO:1 from nucleotide 571 to nucleotide 681. 



1 0. The method of claim 1 , wherein said host cell is a eukaryotic 



30 



cell. 



1 1 . The method of claim 10, wherein said eukaryotic cell is a plant 



cell. 
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12. The method of claim 11, wherein said eukaryotic cell is 
an animal cell. 

5 

13. The method of claim 12, wherein said animal cell is a 
mammalian cell. 

14. The method of claim 13, wherein said mammalian cell is a 
10 human cell. 

15. The method of claim 1, further comprising deleting the 
nucleotide sequences encoding any one of said domains prior to said 
introducing. 

15 

16. The method of claim 1, wherein said protein has a point 
mutation in lysine 304 of SEQ ID NO:2. 

17. The method of claim 16, wherein said mutation results in said 
20 lysine being replaced by an arginine. 

1 8. The method of claim 1 , wherein said protein encodes PKL. 

19. The method of claim 18, wherein said PKL has an amino acid 
25 sequence as set forth in SEQ ID NO:2. 

20. The method of claim 1, wherein said nucleic acid molecule 
has a nucleotide sequence having at least about 80% identity to the 
nucleotide sequence set forth in SEQ ID NO:1 . 
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21. The method of claim 1, wherein said nucleic acid molecule 
further comprises a promoter operably linked to a terminal 5' end of said 
nucleotide sequence. 

22. The method of claim 21, wherein said promoter is selected 
from the group consisting of a constitutive promoter, an inducible promoter 
and a cell-specific promoter. 

23. The method of claim 21, wherein said promoter is a foreign 
promoter. 

24. The method of claim 18, wherein said PKL functions in 
repressing embryonic identity in said plant. 

25. The method of claim 1, wherein said nucleic acid molecule 
comprises a nucleotide sequence having substantial similarity to the 
nucleotide sequence set forth in SEQ ID NO:1. 

26. A method of transforming a host cell, comprising introducing 
into a host cell a nucleic acid molecule encoding a protein functioning in 
regulating developmental identity, said protein having an amino acid 
sequence having at least about 50% identity to the amino acid sequence set 
forth in SEQ ID NO:2, said protein functioning in regulating developmental 
identity. 

27. The method of claim 26, wherein said protein has an amino 
acid sequence having at least about 80% identity to the amino acid 
sequence set forth in SEQ ID NO:2. 

28. The method of claim 27, wherein said protein has an amino 
acid sequence as set forth in SEQ ID NO:2. 
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29. The method of claim 26, wherein said nucleic acid molecule 
has a nucleotide sequence as set forth in SEQ ID NO:1. 

5 30. The method of claim 26, wherein said host cell is a eukaryotic 

cell. 

31 . The method of claim 30, wherein said eukaryotic cell is a 
plant cell. 

10 

32. The method of claim 30, wherein said eukaryotic cell is an 
animal cell. 

33. A method of transforming a host cell, comprising introducing 
15 into a host cell a nucleic acid molecule encoding a protein functioning in 

regulating developmental identity, said nucleic acid molecule having a 
nucleotide sequence having at least about 50% identity to the nucleotide 
sequence set forth in SEQ ID NO:1 from nucleotide 1 to nucleotide 4152. 

20 34. The method of claim 33, wherein said protein functions in 

repressing embryonic identity. 

35. The method of claim 33, wherein said nucleic acid molecule 
has a nucleotide sequence having at least about 80% identity to the 

25 nucleotide sequence set forth in SEQ ID NO:1 from nucleotide 1 to 
nucleotide 4152. 

36. The method of claim 35, wherein said nucleic acid molecule 
has a nucleotide sequence as set forth in SEQ ID NO:1 from nucleotide 1 

30 to nucleotide 4152. 
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37. The method of claim 33, wherein said nucleic acid molecule 
further comprises a promoter operably linked to a terminal 5' end of said 
nucleotide sequence. 

5 38. A method of transforming a host cell, comprising: 

(a) introducing into a host cell an antisense DNA or RNA 
molecule comprising a nucleotide sequence complementary to a length of 
nucleotides within a nucleic acid molecule encoding a protein having at 
least one chromo domain, a helicase domain and a DNA binding domain, 

10 said protein functioning in regulating developmental identity; and 

(b) culturing said host cell under conditions effective for 
hybridization of said antisense molecule to nucleic acid of said host to 
regulate developmental identity. 

15 39. The method of claim 38, wherein said protein encodes PKL. 

40. The method of claim 38, wherein said nucleotide sequence is 
about 100 to about 1000 nucleotides in length. 

20 41. The method of claim 38 wherein said nucleotide sequence is 

complementary to a region from about nucleotide 2 to about nucleotide 361 
set forth in SEQ ID NO:1. 

42. The method of claim 38, wherein said nucleotide sequence is 
25 complementary to a region from about nucleotide 3330 to about nucleotide 

3710. 

43. The method of claim 1, wherein said nucleic acid molecule 
further encodes a protein having at least one zinc finger domain. 

30 



WO 01/14519 



PCT/US00/22725 



56 

44. The method of claim 38, wherein said nucleic acid molecule 
further encodes a second chromo domain. 

45. A method of transforming a host cell, comprising: 

5 (a) introducing into a host cell an antisense DNA or RNA 

molecule comprising a nucleotide sequence complementary to a length of 
nucleotides within a first nucleotide sequence having at least about 50% 
identity to the nucleotide sequence set forth in SEQ ID NO:1, said first 
nucleotide sequence encoding a protein functioning in regulating 
10 developmental identity; and 

(b) culturing said host cell under conditions effective for 
hybridization of said antisense nucleotide sequence to nucleic acid of said 
host cell. 

15 46. The method of claim 45, wherein said first nucleotide 

sequence has at least about 80% identity to the nucleotide sequence set 
forth in SEQ ID NO:1. 

47. The method of claim 46, wherein said first nucleotide 
20 sequence is a nucleotide sequence that encodes PKL. 

48. The method of claim 46, wherein said first nucleotide 
sequence is as set forth in SEQ ID NO:1 from nucleotide 1 to nucleotide 
4152. 

25 

49. The method of claim 45, wherein said antisense molecule is 
about 100 to about 1000 nucleotides in length. 

50. The method of claim 45, wherein said nucleotide sequence in 
30 complementary to a region from about nucleotide 2 to about nucleotide 361 

set forth in SEQ ID NO:1. 
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51 • The method of claim 45, wherein said nucleotide sequence is 
complementary to a region from about nucleotide 3330 to about nucleotide 
5 3710 set forth in SEQ ID NO:1. 



52. A method of transforming a host cell, comprising: 

(a) introducing into a host cell a vector comprising a first 
nucleic acid molecule having a nucleotide sequence that is complementary 

10 to a nucleotide sequence having at least about 50% identity to a length of 
nucleotides within the nucleotide sequence set forth in SEQ ID NO:1, said 
nucleotide sequence encoding a protein functioning in regulating 
developmental identity; 

(b) generating an antisense nucleic acid molecule 
> 15 complementary to an RNA transcript formed from SEQ ID NO:1 ; and 

(b) culturing said host cell under conditions effective for 
hybridization of said antisense molecule to said RNA transcript of said host 
cell. 



20 53. The method of claim 52, wherein said nucleic acid molecule 

has a nucleotide sequence that is complementary to a length of nucleotides 
within the nucleotide sequence set forth in SEQ ID NO:1. 

54. The method of claim 52, wherein the antisense nucleic acid 
25 molecule is an RNA molecule. 

55. A recombinant nucleic acid molecule, comprising: 

(a) a nucleotide sequence encoding a protein functioning in 
regulating developmental identity, said protein having at least one chromo 
30 domain, a helicase domain and a DNA binding domain, said protein 
expressible in an amount sufficient to regulate developmental identity; and 
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(b) a foreign promoter operably linked to a terminal 5' end 
of said nucleotide sequence. 

56. The method of claim 55, wherein said protein further has at 
least one zinc finger domain. 

57. The method of claim 55, wherein said protein further has a 
second chromo domain. 



10 



58. A recombinant nucleic acid molecule, comprising: 

(a) a nucleotide sequence encoding a protein functioning in 
15 regulating developmental identity, said protein having an amino acid 

sequence having at least about 50% identity to the amino acid sequence set 
forth in SEQ ID NO:2 ; and 

(b) a foreign promoter operably linked to a terminal 5' end 
of said nucleotide sequence. 

20 

59. The molecule of claim 58, wherein said foreign promoter is 
selected from the group consisting of a constitutive promoter, an inducible 
promoter and a cell-specific promoter. 

25 60. The molecule of claim 58, wherein said protein has an amino 

acid sequence having at least about 70% identity to the amino acid 
sequence set forth in SEQ ID NO:2. 

61 . The molecule of claim 58, wherein said protein has an amino 
30 acid sequence of PKL. 

62. The molecule of claim 61, wherein said protein has an amino 
acid sequence as set forth in SEQ ID NO:2. 
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63. A recombinant nucleic acid molecule, comprising: 

(a) a nucleotide sequence encoding a protein functioning 
in regulating developmental identity, said nucleotide sequence having at 
least about 50% identity to the nucleotide sequence set forth in SEQ ID 

5 NO:1 from nucleotide 1 to nucleotide 4152; and 

(b) a foreign promoter operably linked to a terminal 5' end 
of said nucleotide sequence. 

64. The molecule of claim 63, wherein said foreign promoter is 
10 selected from the group consisting of a constitutive promoter, an inducible 

promoter and a cell-specific promoter. 

65. The molecule of claim 63, wherein said nucleotide sequence 
has at least about 80% identity to the nucleotide sequence set forth in SEQ 

15 ID NO:1 from nucleotide 1 to nucleotide 4152. 

66. The molecule of claim 65, wherein said nucleotide sequence 
is as set forth in SEQ ID NO:1 from nucleotide 1 to nucleotide 4152. 

20 67. An isolated nucleic acid molecule, comprising a nucleotide 

sequence encoding a protein functioning in regulating developmental 
identity, said nucleotide sequence encoding a protein having at least one 
chromo domain, a helicase domain and a DNA binding domain. 

2 5 68. The molecule of claim 67, wherein said nucleic acid molecule 

further encodes a protein having at least one zinc finger domain. 

69. The molecule of claim 68, wherein said nucleic acid molecule 
further encodes a second chromo domain. 
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70. A eukaryotic cell, comprising: 

(a) an introduced nucleic acid molecule having a 
nucleotide sequence encoding a protein functioning in regulating 
developmental identity, said protein having an amino acid sequence having 
at least about 50% identity to the amino acid sequence set forth in SEQ ID 
NO:2. 

71. The cell of claim 70, wherein said protein has an amino acid 
sequence having at least about 50% identity to the amino acid sequence set 
forth in SEQ ID NO:2. 

72. The cell of claim 71, wherein said protein has an amino acid 
sequence as set forth in SEQ ID NO:2. 

73. The cell of claim 70, wherein said cell is a plant cell. 

74. The cell of claim 70, wherein said cell is an animal cell. 

75. A transgenic plant, comprising: 

(a) an introduced nucleic acid molecule having a 
nucleotide sequence encoding a plant protein functioning in regulating 
developmental identity, said protein having an amino acid sequence having 
at least about 50% identity to the amino acid sequence set forth in SEQ ID 
NO:2; and 

(b) a foreign promoter operably linked to a terminal 5' end 
of said nucleotide sequence. 

76. The transgenic plant of claim 75, wherein said nucleotide 
sequence is an antisense DNA or RNA molecule. 
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77. The transgenic plant of claim 75, wherein said protein has an 
amino acid sequence having at least about 80% identity to the amino acid 
sequence set forth in SEQ ID NO:2. 

78. The transgenic plant of claim 77, wherein said protein has the 
amino acid sequence of PKL. 

79. The transgenic plant of claim 78, wherein said amino acid 
sequence is as set forth in SEQ ID NO:2. 

80. A recombinant protein, comprising a protein having an amino 
acid sequence having at least about 50% identity to the amino acid 
sequence set forth in SEQ ID NO:2. 

81. The protein of claim 80, wherein said protein has an amino 
acid sequence having at least about 80% identity to the amino acid 
sequence set forth in SEQ ID NO:2. 

82. The protein of claim 81, wherein said protein has an amino 
acid sequence as set forth in SEQ ID NO:2. 

83. A method of producing a PKL protein, comprising: 

(a) introducing a nucleotide sequence encoding a protein 
having at least about 50% identity to the amino acid sequence set forth in 
SEQ ID NO:2; and 

(b) culturing said host cell under conditions effective to 
achieve expression of the PKL polypeptide. 
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<110> Ogas, Joseph P. 

Somerville, Christopher R. 

<12 0> Methods and Compositions for Regulating Developmental 
Identity 

<130> 7024-473 

<14 0> Unknown 

<141> 2000-08-18 

<150> US 60/149,975 

<151> 1999-08-20 

<160> 30 

<170> MS Notebook 
<210> 1 
<211> 4177 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<400> 1 

atg agt agt ttg gtg gag agg ctt cgc ata cga tct gat agg aaa cca 48 

Met Ser Ser Leu Val Glu Arg Leu Arg lie Arg Ser Asp Arg Lys Pro 
1 5 10 15 

gtt tat aac eta gat gat tct gat gat gac gac ttc gtt cct aaa aaa 96 

Val Tyr Asn Leu Asp Asp Ser Asp Asp Asp Asp Phe Val Pro Lys Lys 
20 25 30 

gat cga acc ttt gag caa gtc gag get att gtc aga act gat gcg aaa 144 

Asp Arg Thr Phe Glu Gin Val Glu Ala lie Val Arg Thr Asp Ala Lys 
35 40 45 

gaa aat gca tgt cag get tgt ggg gaa agt act aat ctt gta age tgc 192 
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Glu Asn Ala Cys Gin Ala Cys Gly Glu Ser Thr Asn Leu Val Ser Cys 
50 55 60 

aat aca tgc act tat gcg ttc cat get aaa tgc tta gtt cca cct ctt 240 

Asn Thr Cys Thr Tyr Ala Phe His Ala Lys Cys Leu Val Pro Pro Leu 
65 70 75 80 

aaa gat get tec gtg gaa aat tgg aga tgc cct gaa tgt gtt agt cct 288 

Lys Asp Ala Ser Val Glu Asn Trp Arg Cys Pro Glu Cys Val Ser Pro 

85 90 ~ 95 

ctt aac gag ata gat aag ata ttg gat tgt gaa atg cgt cct aca aaa 33 6 

Leu Asn Glu lie Asp Lys lie Leu Asp Cys Glu Met Arg Pro Thr Lys 
100 105 110 

s. tct agt gaa caa ggt tec tec gat gcg gaa ccg aag cca att ttt gtg 3 84 

Ser Ser Glu Gin Gly Ser Ser Asp Ala Glu Pro Lys Pro lie Phe Val 
115 120 125 

aaa cag tat etc gtg aag tgg aag gga tta tea tac ctt cac tgc tct 432 

Lys Gin Tyr Leu Val Lys Trp Lys Gly Leu Ser Tyr Leu His Cys Ser 
130 135 140 

tgg gtg cct gag aag gag ttc cag aag get tat aag tea aat cat cgt 480 

Trp Val Pro Glu Lys Glu Phe Gin Lys Ala Tyr Lys Ser Asn His Arg 
145 150 155 160 

tta aaa acc aga gtg aac aat ttt cac cgt caa atg gag tea ttc aat 528 

Leu Lys Thr Arg Val Asn Asn Phe His Arg Gin Met Glu Ser Phe Asn 

165 170 175 

aac age gaa gat gat ttt gtt gec ata cgt cct gag tgg acc act gtt 576 

Asn Ser Glu Asp Asp Phe Val Ala lie Arg Pro Glu Trp Thr Thr Val 
180 185 190 

gat egg att ctt gee tgc aga gag gaa gat ggg gag ctg gaa tat ctt 624 

Asp Arg lie Leu Ala Cys Arg Glu Glu Asp Gly Glu Leu Glu Tyr Leu 
195 200 205 

gtc aaa tat aaa gag eta tec tat gat gaa tgt tat tgg gag tea gaa 672 

Val Lys Tyr Lys Glu Leu Ser Tyr Asp Glu Cys Tyr Trp Glu Ser Glu 
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Pklseql -app 
210 215 . 220 

tea gac ate tea acc ttc cag aat gaa att caa agg ttc aag gat gta 72 0 

Ser Asp He Ser Thr Phe Gin Asn Glu He Gin Arg Phe Lys Asp Val 
225 230 235 240 

aat tct aga act cgc aga agt aaa gat gtt gac cat aaa aga aat ccc 768 

Asn Ser Arg Thr Arg Arg Ser Lys Asp Val Asp His Lys Arg Asn Pro 

245 250 255 

aga gac ttt caa cag ttt gat cat act cct gaa ttc etc aaa ggc ttg 816 

Arg Asp Phe Gin Gin Phe Asp His Thr Pro Glu Phe Leu Lys Gly Leu 
260 265 270 

tta cat cca tac cag ctt gag gga ctt aat ttt ttg egg ttc teg tgg 864 

"w Leu His Pro Tyr Gin Leu Glu Gly Leu Asn Phe Leu Arg Phe Ser Trp 
275 280 285 

tea aaa cag acg cat gta ate ctt get gat gaa atg gga eta ggc aag 912 

Ser Lys Gin Thr His Val He Leu Ala Asp Glu Met Gly Leu Gly Lys 

290 295 300 

aca att caa age att gee ctt tta get tea ctt ttt gag gag aac etc 960 

Thr He Gin Ser He Ala Leu Leu Ala Ser Leu Phe Glu Glu Asn Leu 
305 310 315 320 

att ccg cat ttg gta att get cct eta teg act ctg cgt aac tgg gag 1008 

lie Pro His Leu Val He Ala Pro Leu Ser Thr Leu Arg Asn Trp Glu 

325 33 0 335 

aga gag ttt gec aca tgg gec cca cag atg aac gtg gtt atg tat ttt 1056 

Arg Glu Phe Ala Thr Trp Ala Pro Gin Met Asn Val Val Met Tyr Phe 
340 345 350 

ggc act gcg caa get cga gca gtt ate aga gaa cat gag ttt tac tta 1104 

Gly Thr Ala Gin Ala Arg Ala Val He Arg Glu His Glu Phe Tyr Leu 
355 360 365 

teg aaa gat caa aaa aag ate aag aaa aag aaa tct gga caa ata agt 1152 

Ser Lys Asp Gin Lys Lys He Lys Lys Lys Lys Ser Gly Gin He Ser 
370 375 380 
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age gaa age aag caa aaa aga ate aag ttt gat gtc etc etc aca teg 1200 

Ser Glu Ser Lys Gin Lys Arg He Lys Phe Asp Val Leu Leu Thr Ser 
385 390 395 400 

tat gag atg ate aac eta gat tea gca gtt eta aaa cca att aag tgg 1248 

Tyr Glu Met He Asn Leu Asp Ser Ala Val Leu Lys Pro He Lys Trp 

405 410 415 

gag tgc atg att gtt gat gaa ggt cat cga ctg aaa aat aag gat tea 12 9 6 

Glu Cys Met lie Val Asp Glu Gly His Arg Leu Lys Asn Lys Asp Ser 
420 425 430 

aag ctg ttc tct tea ttg aca cag tat tea agt aac cac cgt att ctt 1344 

— , Lys Leu Phe Ser Ser Leu Thr Gin Tyr Ser Ser Asn His Arg He Leu 
W 435 440 445 

ctg aca gga aca cca ctt cag aac aac ttg gat gaa ctt ttc atg etc 1392 

Leu Thr Gly Thr Pro Leu Gin Asn Asn Leu Asp Glu Leu Phe Met Leu 
450 455 460 

atg cat ttt ctt gat gcg ggg aag ttt gga agt ttg gag gag ttc cag 1440 

Met His Phe Leu Asp Ala Gly Lys Phe Gly Ser Leu Glu Glu Phe Gin 
465 470 475 480 

gag gag ttc aaa gat att aat caa gag gag cag ate tea agg ttg cac 1488 

Glu Glu Phe Lys Asp He Asn Gin Glu Glu Gin He Ser Arg Leu His 

485 490 495 

aaa atg ttg get cca cat ttg etc aga agg gta aaa aaa gac gta atg 153 6 

Lys Met Leu Ala Pro His Leu Leu Arg Arg Val Lys Lys Asp Val Met 
500 505 510 

aaa gac atg ccc ccc aaa aag gag etc att ttg cgt gtt gat ctg age 1584 

Lys Asp Met Pro Pro Lys Lys Glu Leu He Leu Arg Val Asp Leu Ser 
515 520 525 

agt ctg cag aaa gaa tat tac aaa get att ttt acc cgt aat tat caa 1632 

Ser Leu Gin Lys Glu Tyr Tyr Lys Ala He Phe Thr Arg Asn Tyr Gin 
530 535 540 
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gta ttg aca aaa aag gga ggt get caa att tec ctt aat aac att atg 1680 

Val Leu Thr Lys Lys Gly Gly Ala Gin lie Ser Leu Asn Asn lie Met 
545 550 555 560 

atg gaa tta cga aaa gta tgc tgc cat cct tat atg eta gag ggt gtt 1728 

Met Glu Leu Arg Lys Val Cys Cys His Pro Tyr Met Leu Glu Gly Val 

565 570 575 

gag cca gtt att cac gac gca aat gaa get ttc aaa caa ctt ttg gag 177 6 

Glu Pro Val lie His Asp Ala Asn Glu Ala Phe Lys Gin Leu Leu Glu 
580 585 590 

tct tgt gga aag ctg caa ctt eta gat aaa atg atg gtc aaa ctg aaa 1824 

Ser Cys Gly Lys Leu Gin Leu Leu Asp Lys Met Met Val Lys Leu Lys 
595 600 605 

gag caa gga cac aga gtc eta ata tac aca cag ttt cag cat atg ctg 1872 

Glu Gin Gly His Arg Val Leu He Tyr Thr Gin Phe Gin His Met Leu 
610 615 620 

gac tta ctt gaa gac tac tgt ace cat aag aaa tgg cag tac gag cga 1920 

Asp Leu Leu Glu Asp Tyr Cys Thr His Lys Lys Trp Gin Tyr Glu Arg 
625 630 635 " 640 

att gat gga aag gtt ggc gga get gag egg caa ata cgc ata gat egg 1968 

He Asp Gly Lys Val Gly Gly Ala Glu Arg Gin He Arg lie Asp Arg 

645 650 " 655 

ttc aat gee aaa aat tct aac aag ttt tgt ttt ttg etc tec aca aga 2016 

Phe Asn Ala Lys Asn Ser Asn Lys Phe Cys Phe Leu Leu Ser Thr Arg 
660 665 670 

get ggt ggc tta gga ata aat ctt gca acg get gat aca gta ate att 2064 

Ala Gly Gly Leu Gly He Asn Leu Ala Thr Ala Asp Thr Val He He 
675 680 685 

tat gac agt gac tgg aat cct cat get gat ctt caa gca atg get aga 2112 

Tyr Asp Ser Asp Trp Asn Pro His Ala Asp Leu Gin Ala Met Ala Arg 
690 695 700 

get cat cga ctt ggc caa aca aat aag gtg atg att tat agg etc ata 2160 
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Ala His Arg Leu Gly Gin Thr Asn Lys Val Met lie Tyr Arg Leu lie 
705 710 715 " 720 

aac cga ggc acc att gaa gaa agg atg atg caa ttg act aaa aag aaa 22 08 

Asn Arg Gly Thr He Glu Glu Arg Met Met Gin Leu Thr Lys Lys Lys 

725 730 " 735 

atg gtt eta gag cat ctt gtt gtt ggg aaa etc aaa aca caa aac att 2256 

Met Val Leu Glu His Leu Val Val Gly Lys Leu Lys Thr Gin Asn He 
740 745 750 

aat cag gaa gag tta gat gac ate ate agg tat gga tea aag gag ctt 23 04 

Asn Gin Glu Glu Leu Asp Asp He He Arg Tyr Gly Ser Lys Glu Leu 
755 760 765 



ttt get agt gaa gat gat gaa gca gga aag tct gga aaa att 



cat tat 2352 



Phe Ala Ser Glu Asp Asp Glu Ala Gly Lys Ser Gly Lys He His Tyr 
770 775 780 

gat gat gcg get ata gac aaa ttg ctt gat cgt gat etc gtg gag gca 2400 

Asp Asp Ala Ala He Asp Lys Leu Leu Asp Arg Asp Leu Val Glu Ala 
785 790 795 * 800 

gag gaa gtc tea gtg gat gat gaa gag gag aat gga ttc tta aag get 2 44 8 

Glu Glu Val Ser Val Asp Asp Glu Glu Glu Asn Gly Phe Leu Lys Ala 

80S 810 " 815 

ttc aag gtg get aat ttt gaa tat ata gat gaa aat gag gca gca gca 2496 

Phe Lys Val Ala Asn Phe Glu Tyr He Asp Glu Asn Glu Ala Ala Ala 
820 825 830 

tta gag gca cag aga gtc get get gaa age aaa tct tea gca ggc aat 2 544 

Leu Glu Ala Gin Arg Val Ala Ala Glu Ser Lys Ser Ser Ala Gly Asn 
835 840 845 

tct gat aga gca agt tat tgg gaa gag ttg tta aaa gat aaa ttt gag 2 592 

Ser Asp Arg Ala Ser Tyr Trp Glu Glu Leu Leu Lys Asp Lys Phe Glu 
850 855 860 

ctg cac cag get gag gag ctt aat get ctt gga aaa agg aag aga agt 2 640 
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Leu His Gin Ala Glu Glu Leu Asn Ala Leu Gly Lys Arg Lys Arg Ser 
865 870 875 " 880 

cgc aag cag ttg gta tec att gaa gaa gat gat ctt get ggt ttg gaa 2 688 

Arg Lys Gin Leu Val Ser lie Glu Glu Asp Asp Leu Ala Gly Leu Glu 

885 890 895 

gat gtg age tct gat gga gat gaa agt tat gaa get gag tea aca gat 273 6 

Asp Val Ser Ser Asp Gly Asp Glu Ser Tyr Glu Ala Glu Ser Thr Asp 
900 905 910 

ggt gaa gca gca gga caa gga gtt cag acg ggt cga egg ccg tac aga 2784 

Gly Glu Ala Ala Gly Gin Gly Val Gin Thr Gly Arg Arg Pro Tyr Arg 
915 920 925 

-..aga aag ggt cgc gat aat ttg gaa cca act ccg ttg atg gaa ggt gag 2832 

Arg Lys Gly Arg Asp Asn Leu Glu Pro Thr Pro Leu Met Glu Gly Glu 
930 935 940 

ggg aga tct ttc aga gta ctg ggt ttc aac cag agt caa agg gec att 2880 

Gly Arg Ser Phe Arg Val Leu Gly Phe Asn Gin Ser Gin Arg Ala lie 
945 950 955 960 

ttt gta cag act ttg atg agg tat gga get ggc aat ttt gat tgg aag 2928 

Phe Val Gin Thr Leu Met Arg Tyr Gly Ala Gly Asn Phe Asp Trp Lys 

965 970 975 

gag ttt gtt cct cgc tta aag cag aag acc ttt gaa gaa ata aat gaa 2976 

Glu Phe Val Pro Arg Leu Lys Gin Lys Thr Phe Glu Glu lie Asn Glu 
~" 980 985 990 

tat gga ata etc ttc ttg aag cac att get gaa gaa ata gac gag aat 3024 

Tyr Gly lie Leu Phe Leu Lys His lie Ala Glu Glu lie Asp Glu Asn 
995 1000 1005 

tct cca acc ttt tea gat ggt gtg ccc aag gaa gga ctt aga ata gaa 3072 

Ser Pro Thr Phe Ser Asp Gly Val Pro Lys Glu Gly Leu Arg lie Glu 
1010 1015 1020 

gat gtt eta gtc aga att get ctt ctg ata eta gtt cag gag aag gtg 3120 

Asp Val Leu Val Arg lie Ala Leu Leu lie Leu Val Gin Glu Lys Val 
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1025 1030 1035 1040 

aaa ttt gta gaa gat cat cca ggg aaa cct gtt ttc ccc tct cgc att 3168 

Lys Phe Val Glu Asp His Pro Gly Lys Pro Val Phe Pro Ser Arg lie 

1045 1050 1055 

ctt gaa aga ttc ccc gga ctg aga agt gga aaa att tgg aag gag gaa 3 216 

Leu Glu Arg Phe Pro Gly Leu Arg Ser Gly Lys lie Trp Lys Glu Glu 
1060 1065 1070 

cat gac aag ata atg ata cgt get gtt tta aag cat ggg tac gga egg 3264 

His Asp Lys lie Met He Arg Ala Val Leu Lys His Gly Tyr Gly Arg 
1075 1080 1085 

tgg caa get att gtt gat gac aaa gag ttg ggg ate caa gag ctt ate 3 312 

Trp Gin Ala He Val Asp Asp Lys Glu Leu Gly He Gin Glu Leu lie 
1090 1095 1100 

tgc aaa gaa ttg aat ttc cct cac ata agt ttg tct get get gaa caa 3360 

Cys Lys Glu Leu Asn Phe Pro His He Ser Leu Ser Ala Ala Glu Gin 
1105 1110 1115 1120 

get ggt ttg cag ggg cag aat ggt agt ggg ggc tct aat ccg gga gca 3408 

Ala Gly Leu Gin Gly Gin Asn Gly Ser Gly Gly Ser Asn Pro Gly Ala 

1125 1130 1135 

cag act aac cag aat cct gga age gtt att act ggg aac aat aat get 3 456 

.... Gin Thr Asn Gin Asn Pro Gly Ser Val He Thr Gly Asn Asn Asn Ala 
1140 1145 1150 

tct get gat ggg get caa gta aac teg atg ttc tat tat egg gac atg 3 504 

Ser Ala Asp Gly Ala Gin Val Asn Ser Met Phe Tyr Tyr Arg Asp Met 
1155 1160 1165 

cag aga cga ctt gtt gag ttt gtg aaa aag cga gtt ctg ctt ttg gag 3552 

Gin Arg Arg Leu Val Glu Phe Val Lys Lys Arg Val Leu Leu Leu Glu 
1170 1175 1180 

aag gcg atg aat tat gaa tac gca gag gaa tat tat gga ctt ggt ggc 3 600 

Lys Ala Met Asn Tyr Glu Tyr Ala Glu Glu Tyr Tyr Gly Leu Gly Gly 
1185 1190 1195 1200 
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tea tea tct ate cct act gaa gaa cca gaa get gaa cca aag ate get 3 648 

Ser Ser Ser lie Pro Thr Glu Glu Pro Glu Ala Glu Pro Lys lie Ala 

1205 1210 1215 

gac aca gtg gga gtg age ttt att gag gtt gat gat gaa atg ctt gat 3696 

Asp Thr Val Gly Val Ser Phe lie Glu Val Asp Asp Glu Met Leu Asp 
1220 1225 ~ 1230 

gga ctt cct aag act gat cct ate act tea gaa gaa att atg ggg get 3744 

Gly Leu Pro Lys Thr Asp Pro lie Thr Ser Glu Glu lie Met Gly Ala 
1235 1240 1245 

get gtt gac aac aac caa gcg egg gtc gaa ata get caa cat tat aac 3792 

"N, Ala Val Asp Asn Asn Gin Ala Arg Val Glu lie Ala Gin His Tyr Asn 
1250 1255 1260 

cag atg tgc aaa ctt ctt gat gag aac get egg gaa tea gtc caa gca 3 840 

Gin Met Cys Lys Leu Leu Asp Glu Asn Ala Arg Glu Ser Val Gin Ala 
1265 1270 1275 1280 

tat gta aac aac caa cca ccg agt ace aag gtg aat gag age ttc cgt 3888 

Tyr Val Asn Asn Gin Pro Pro Ser Thr Lys Val Asn Glu Ser Phe Arg 

1285 1290 1295 

gca etc aaa tct ate aat ggt aac att aac aca ate ctt teg att aca 393 6 

Ala Leu Lys Ser lie Asn Gly Asn lie Asn Thr lie Leu Ser lie Thr 
1300 1305 1310 

tct gat caa tec aag tea cat gaa gac gac ace aag cca gac eta aac 3984 

Ser Asp Gin Ser Lys Ser His Glu Asp Asp Thr Lys Pro Asp Leu Asn 
1315 1320 1325 

aat gtt gag atg aag gac acg gee gaa gaa aca aaa ccg tta aga ggt 4032 

Asn Val Glu Met Lys Asp Thr Ala Glu Glu Thr Lys Pro Leu Arg Gly 
1330 1335 1340 

ggc gtc gtc gat ctg aat gtg gtg gag gga gag gag aac att get gaa 4080 

Gly Val Val Asp Leu Asn Val Val Glu Gly Glu Glu Asn lie Ala Glu 
1345 1350 1355 1360 
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get agt gga agt gtt gat gta aaa atg gaa gaa gec aaa gaa gaa gag 412 8 

Ala Ser Gly Ser Val Asp Val Lys Met Glu Glu Ala Lys Glu Glu Glu 

1365 1370 1375 

aag cca aag aac atg gtc gtt gat tgactcaact ggtaaatcaa gattc 4177 

Lys Pro Lys Asn Met Val Val Asp 
1380 



<210> 2 

<211> 1384 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 2 

Met Ser Ser Leu Val Glu Arg Leu Arg lie Arg Ser Asp Arg Lys Pro 
1 5 10 15 

Val Tyr Asn Leu Asp Asp Ser Asp Asp Asp Asp Phe Val Pro Lys Lys 
20 25 30 

Asp Arg Thr Phe Glu Gin Val Glu Ala lie Val Arg Thr Asp Ala Lys 
3 5 40 45 

Glu Asn Ala Cys Gin Ala Cys Gly Glu Ser Thr Asn Leu Val Ser Cys 
50 55 60 

Asn Thr Cys Thr Tyr Ala Phe His Ala Lys Cys Leu Val Pro Pro Leu 
65 70 75 80 

Lys Asp Ala Ser Val Glu Asn Trp Arg Cys Pro Glu Cys Val Ser Pro 

85 90 9 5 

Leu Asn Glu lie Asp Lys lie Leu Asp Cys Glu Met Arg Pro Thr Lys 
100 105 110 

Ser Ser Glu Gin Gly Ser Ser Asp Ala Glu Pro Lys Pro lie Phe Val 
115 120 125 
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Lys Gin Tyr Leu Val Lys Trp Lys Gly Leu Ser Tyr Leu His Cys Ser 
130 135 140 

Trp Val Pro Glu Lys Glu Phe Gin Lys Ala Tyr Lys Ser Asn His Arg 
145 150 155 160 

Leu Lys Thr Arg Val Asn Asn Phe His Arg Gin Met Glu Ser Phe Asn 

165 170 175 

Asn Ser Glu Asp Asp Phe Val Ala lie Arg Pro Glu Trp Thr Thr Val 
180 185 190 

Asp Arg lie Leu Ala Cys Arg Glu Glu Asp Gly Glu Leu Glu Tyr Leu 
195 200 205 

J 

Val Lys Tyr Lys Glu Leu Ser Tyr Asp Glu Cys Tyr Trp Glu Ser Glu 
210 215 220 

Ser Asp lie Ser Thr Phe Gin Asn Glu lie Gin Arg Phe Lys Asp Val 
225 230 235 240 

Asn Ser Arg Thr Arg Arg Ser Lys Asp Val Asp His Lys Arg Asn Pro 

245 250 255 

Arg Asp Phe Gin Gin Phe Asp His Thr Pro Glu Phe Leu Lys Gly Leu 
260 265 270 

Leu His Pro Tyr Gin Leu Glu Gly Leu Asn Phe Leu Arg Phe Ser Trp 
275 280 285 

Ser Lys Gin Thr His Val lie Leu Ala Asp Glu Met Gly Leu Glv Lvs 
290 295 300 

Thr lie Gin Ser lie Ala Leu Leu Ala Ser Leu Phe Glu Glu Asn Leu 
3 °5 310 315 320 

lie Pro His Leu Val lie Ala Pro Leu Ser Thr Leu Arg Asn Trp Glu 

325 330 335 
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Arg Glu Phe Ala Thr Trp Ala Pro Gin Met Asn Val Val Met Tyr Phe 
340 345 350 

Gly Thr Ala Gin Ala Arg Ala Val lie Arg Glu His Glu Phe Tyr Leu 
355 360 365 

Ser Lys Asp Gin Lys Lys lie Lys Lys Lys Lys Ser Gly Gin lie Ser 
370 375 380 

Ser Glu Ser Lys Gin Lys Arg lie Lys Phe Asp Val Leu Leu Thr Ser 
385 390 395 400 

Tyr Glu Met lie Asn Leu Asp Ser Ala Val Leu Lys Pro lie Lys Trp 

405 410 415 

Glu Cys Met lie Val Asp Glu Gly His Arg Leu Lys Asn Lys Asp Ser 
420 425 430 

Lys Leu Phe Ser Ser Leu Thr Gin Tyr Ser Ser Asn His Arg lie Leu 
435 440 445 

Leu Thr Gly Thr Pro Leu Gin Asn Asn Leu Asp Glu Leu Phe Met Leu 
450 455 460 

Met His Phe Leu Asp Ala Gly Lys Phe Gly Ser Leu Glu Glu Phe Gin 
465 470 475 480 

Glu Glu Phe Lys Asp He Asn Gin Glu Glu Gin He Ser Arg Leu His 

485 490 495 

Lys Met Leu Ala Pro His Leu Leu Arg Arg Val Lys Lys Asp Val Met 
500 505 510 

Lys Asp Met Pro Pro Lys Lys Glu Leu lie Leu Arg Val Asp Leu Ser 
515 520 525 

Ser Leu Gin Lys Glu Tyr Tyr Lys Ala He Phe Thr Arg Asn Tyr Gin 
530 535 540 
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Val Leu Thr Lys Lys Gly Gly Ala Gin lie Ser Leu Asn Asn lie Met 
545 ' 550 555 560 

Met Glu Leu Arg Lys Val Cys Cys His Pro Tyr Met Leu Glu Gly Val 

565 570 575 

Glu Pro Val lie His Asp Ala Asn Glu Ala Phe Lys Gin Leu Leu Glu 
580 585 590 

Ser Cys Gly Lys Leu Gin Leu Leu Asp Lys Met Met Val Lys Leu Lys 
595 600 605 

/Glu Gin Gly His Arg Val Leu lie Tyr Thr Gin Phe Gin His Met Leu 
610 615 620 

Asp Leu Leu Glu Asp Tyr Cys Thr His Lys Lys Trp Gin Tyr Glu Arg 
625 630 635 640 

lie Asp Gly Lys Val Gly Gly Ala Glu Arg Gin lie Arg lie Asp Arg 

645 650 655 

Phe Asn Ala Lys Asn Ser Asn Lys Phe Cys Phe Leu Leu Ser Thr Arg 
660 665 670 

j Ala Gly Gly Leu Gly lie Asn Leu Ala Thr Ala Asp Thr Val lie lie 
675 680 685 

Tyr Asp Ser Asp Trp Asn Pro His Ala Asp Leu Gin Ala Met Ala Arg 
690 695 700 

Ala His Arg Leu Gly Gin Thr Asn Lys Val Met lie Tyr Arg Leu lie 
705 710 715 ~ 720 

Asn Arg Gly Thr lie Glu Glu Arg Met Met Gin Leu Thr Lys Lys Lys 

725 730 735 



Hi 



Met Val Leu Glu His 



Leu Val 



Val Gly Lys 
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745 



750 



Asn Gin Glu Glu Leu Asp Asp lie lie Arg Tyr Gly Ser Lys Glu Leu 
755 760 765 

Phe Ala Ser Glu Asp Asp Glu Ala Gly Lys Ser Gly Lys lie His Tyr 
770 775 780 



Asp Asp Ala Ala lie Asp Lys Leu Leu Asp Arg Asp Leu Val Glu Ala 
785 790 795 800 



Glu Glu Val Ser Val Asp Asp Glu Glu Glu Asn Gly Phe Leu Lys Ala 

805 810 815 

Phe Lys Val Ala Asn Phe Glu Tyr He Asp Glu Asn Glu Ala Ala Ala 

820 825 830 



Leu Glu Ala Gin Arg Val Ala Ala Glu Ser Lys Ser Ser Ala Gly Asn 
835 840 845 

Ser Asp Arg Ala Ser Tyr Trp Glu Glu Leu Leu Lys Asp Lys Phe Glu 
850 855 860 

Leu His Gin Ala Glu Glu Leu Asn Ala Leu Gly Lys Arg Lys Arg Ser 
865 870 875 880 

Arg Lys Gin Leu Val Ser He Glu Glu Asp Asp Leu Ala Gly Leu Glu 

885 890 895 

Asp Val Ser Ser Asp Gly Asp Glu Ser Tyr Glu Ala Glu Ser Thr Asp 
900 905 910 

Gly Glu Ala Ala Gly Gin Gly Val Gin Thr Gly Arg Arg Pro Tyr Arg 
915 920 925 

Arg Lys Gly Arg Asp Asn Leu Glu Pro Thr Pro Leu Met Glu Gly Glu 
930 935 940 
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Gly Arg Ser Phe Arg Val Leu Gly Phe Asn Gin Ser Gin Arg Ala lie 
945 950 955 960 

Phe Val Gin Thr Leu Met Arg Tyr Gly Ala Gly Asn Phe Asp Trp Lys 

965 970 975 

Glu Phe Val Pro Arg Leu Lys Gin Lys Thr Phe Glu Glu lie Asn Glu 
980 985 990 

Tyr Gly lie Leu Phe Leu Lys His lie Ala Glu Glu lie Asp Glu Asn 
995 1000 1005 

Ser Pro Thr Phe Ser Asp Gly Val Pro Lys Glu Gly Leu Arg lie 
1010 1015 1020 

Glu Asp Val Leu Val Arg lie Ala Leu Leu lie Leu Val Gin Glu 
1025 1030 1035 

Lys Val Lys Phe Val Glu Asp His Pro Gly Lys Pro Val Phe Pro 
1040 1045 1050 

Ser Arg lie Leu Glu Arg Phe Pro Gly Leu Arg Ser Gly Lys lie 
1055 1060 1065 

Trp Lys Glu Glu His Asp Lys lie Met lie Arg Ala Val Leu Lys 
1070 1075 1080 

His Gly Tyr Gly Arg Trp Gin Ala lie Val Asp Asp Lys Glu Leu 
1085 1090 1095 

Gly He Gin Glu Leu He Cys Lys Glu Leu Asn Phe Pro His lie 
1100 1105 1110 

Ser Leu Ser Ala Ala Glu Gin Ala Gly Leu Gin Gly Gin Asn Gly 
1115 1120 1125 

Ser Gly Gly Ser Asn Pro Gly Ala Gin Thr Asn Gin Asn Pro Gly 
1130 1135 1140 
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Ser Val lie Thr Gly Asn Asn Asn Ala Ser Ala Asp Gly Ala Gin 
1145 1150 1155 



Val Asn Ser Met Phe Tyr Tyr Arg Asp Met Gin Arg Arg Leu Val 
1160 1165 1170 



Glu Phe Val Lys Lys Arg Val Leu Leu Leu Glu Lys Ala Met Asn 
1175 1180 1185 



Tyr Glu Tyr Ala Glu Glu Tyr Tyr Gly Leu Gly Gly Ser Ser Ser 
1190 1195 1200 



lie Pro Thr Glu Glu Pro Glu Ala Glu Pro Lys lie Ala Asp Thr 
1205 1210 1215 



Val Gly Val Ser Phe lie Glu Val Asp Asp Glu Met Leu Asp Gly 
1220 1225 1230 



Leu Pro Lys Thr Asp Pro lie Thr Ser Glu Glu lie Met Gly Ala 
1235 1240 1245 



Ala Val Asp Asn Asn Gin Ala Arg Val Glu lie Ala Gin His Tyr 
1250 1255 1260 



Asn Gin Met Cys Lys Leu Leu Asp Glu Asn Ala Arg Glu Ser Val 
1265 1270 1275 

Gin Ala Tyr Val Asn Asn Gin Pro Pro Ser Thr Lys Val Asn Glu 
1280 1285 1290 

Ser Phe Arg Ala Leu Lys Ser lie Asn Gly Asn lie Asn Thr lie 
1295 1300 1305 



Leu Ser lie Thr Ser Asp Gin Ser Lys Ser His Glu Asp Asp Thr 
1310 1315 1320 



Lys Pro Asp Leu Asn Asn Val Glu Met Lys Asp Thr Ala Glu Glu 
1325 1330 1335 
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Thr Lys Pro Leu Arg Gly Gly Val Val Asp Leu Asn Val Val Glu 
1340 1345 1350 

Gly Glu Glu Asn lie Ala Glu Ala Ser Gly Ser Val Asp Val Lys 
1355 1360 1365 

Met Glu Glu Ala Lys Glu Glu Glu Lys Pro Lys Asn Met Val Val 
1370 1375 1380 

Asp 

<210> 3 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 18-19 

<223> AFLP Primer EcoRI for AFLP Mapping Analysis in Example 1; 
n may be a, g, c or t 

<400> 3 

agactgcgta ccatttcnn 19 

<210> 4 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
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<222> 17-19 

<223> AFLP Primer Msel for AFLP Mapping Analysis in Example 1 ,- 
n may be a, g, c or t 

<400> 4 

gatgagtcct gagtaannn 19 

<210> 5 
<211> 21 

<212> DNA 

<213> Arabidopsis thaliana 
) <220> 

<223> Primers for PCR of Example 2 ; 

sequence complementary to nucleotides 1725-1745 of SEQ ID NO:l 

<400> 5 

tgttgagcca gttattcacg a 21 



<210> 6 
<211> 21 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> Primers for PCR of Example 2; 

sequence complementary to nucleotides 1934-1914 in SEQ ID NO:l 

<400> 6 

acctttccat caattcgctc g 21 

<210> 7 
<211> 30 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<222> 1-30 

<223> Primers for PCR of Example 2 

<400> 7 

ccgctcgaga accccaatga ccagctcagt 3 0 

<210> 8 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> 1-21 

<223> Primers for PCR of Example 2; 

sequence complementary to nucleotides 672-652 of 
LEC1 cDNA sequence 

<400> 8 

ccttcttcac ttatactgac c 21 

<210> 9 
<211> 21 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> Primers for PCR of Example 2; 

nucleotides 65-85 of ROC3 cDNA sequence 
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<400> 9 

aagtctactt cgacatgacc g 21 



<210> 10 

<211> 21 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

.-^<223> Primers for PCR of Example 2; 

> j sequence complementary to nucleotides 524-504 of ROC3 

cDNA sequence 

<400> 10 

cttccagagt cagatccaac c 21 

<210> 11 
<211> 30 
<212> DNA 

<213> Arabidopsis thaliana 
- <22 0> 

<223> Primers for PCR of Example 4; 

represent nucleotides 895-924 in SEQ ID NO : 1 wherein nucleotide 
907 is changed from "a" to "g" 

<400> 11 

gaaatgggac taggcaggac aattcaaagc 3 0 

<210> 12 
<211> 30 
<212> DNA 
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<213> Arabidopsis thaliana 
<220> 

<223> Primers for PCR of Example 4; 

represent sequence complementary to nucleotides 

924-895 in SEQ ID NO : 1 , with nucleotide 911 changed from "t" to 

"c" . 

<400> 12 

gctttgaatt gtcctgccta gtcccatttc 30 

<210> 13 
-^<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 1-47 

<223> Primers for PCR of Example 4 
<400> 13 

aagccaaaga acatggtcgt tgatctagag gatcctgaag ctcgaaa 47 

<210> 14 
<211> 52 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 1-52 

<223> Primers for PCR of Example 4 
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<400> 14 

gaatcttgat ttaccagttg agtcattttt gatgaaacag aagctttttg at 52 

<210> 15 
<211> 21 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> Primers for PCR of Example 4 ; 
^ represent sequence complementary to nucleotides 4152-4132 in SEQ 

J ID NO : 1 

<400> 15 

atcaacgacc atgttctttg g 21 

<210> 16 
<211> 22 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> Primers for PCR of Example 4; 

represent nucleotides 4153-4174 in SEQ ID NO:l 

<400> 16 

tgactcaact ggtaaatcaa ga 22 

<210> 17 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> misc_f eature 
<222> 1-30 

<223> Primers for PCR of Example 5 
<400> 17 

ccgctcgagt gagtagtttg gtggagaggc 3 0 

<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> 1-30 

<223> Primers for PCR of Example 5 
<400> 18 

ccggaattcc atcggaggaa ccttgttcac 3 0 

<210> 19 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 1-30 

<223> Primers for PCR of Example 5 
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<400> 



19 



cgcggatccc atcggaggaa ccttgttcac 



30 



<210> 20 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

J)<222> 1-3 0 

<22 3> Primers for PCR of Example 5 
<400> 20 

tgctctagat gagtagtttg gtggagaggc 3 0 

<210> 21 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
- <220> 
<221> misc_feature 
<222> 1-30 

<223> Primers for PCR of Example 5 
<400> 21 

ccgctcgagc cctcacataa gtttgtctgc 3 0 

<210> 22 
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<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 1-30 

<223> Primers for PCR of Example 5 
<400> 22 

ccggaattcg tcttaggaag tccatcaagc 3 0 

<210> 23 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 1-30 

<223> Primers for PCR of Example 5 
<400> 23 

cgcggatccg tcttaggaag tccatcaagc 3 0 

<210> 24 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
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<222> 1-30 



<223> Primers for PCR of Example 5 



<400> 



24 



tgctctagac cctcacataa gtttgtctgc 



30 



<210> 25 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
)<220> 

<221> misc_feature 

<222> 1-31 

<223> Primers for PCR of Example 6 

<400> 25 

cgcggatcct ttttccactt ctcagtccgg g 31 

<210> 26 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<222> 1-34 

<223> Primers for PCR of Example 4 

<400> 26 

cttcgaactc gagggatccc catggctagc agct 34 
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<210> 27 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> 1-34 

<223> Primers for PCR of Example 4 
~^<400> 27 

gctagccatg gggatccctc gagttcgaag gtac 34 

<210> 28 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 1-12 

<223> Primers for forming cassette inserted into pCAMBIA3 3 00 
in Example 4 

<400> 28 

ccaggtacct gg 12 

<210> 29 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<221> misc_f eature 
<222> 1-20 

<223> Primers for forming cassette inserted into pCAMBIA3300 
in Example 4 

<400> 29 

aattccaggt acctggcatg 20 
<210> 3 0 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> 1-38 

<223> Sequence for forming clone of the rat glucocorticoid receptor 
in Example 4 

<400> 30 

tctagaggat cc tgaagctc gaaaaacaaa gaaaaaaa 3 8 
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